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KNOWING NUMBERS NS 


imple arithmetic entails that there are numbers, two of them 

between 3 and 6, for example, and infinitely many thereafter 

But how do we know that there are any such things? If num- 
bers exist, they would surely be abstract, and there seems to be no way 
of explaining our knowledge of abstract objects, short of postulating 
some supernatural mode of apprehending them Numbers in partic- 
ular do not emit or reflect signals, they leave no traces, their behavior 
causes no phenomena from which their existence may be inferred 
Although ıt ıs plausible that arithmetic 15 indispensable to empirical 
science, this has been disputed, and even if true, its significance for 
the truth of arithmetic and the existence of numbers 1s moot, as one 
mught take an instrumentalist view of arithmetic ın science ! Consid- 
erations of this sort have led some people to conclude that we do not 
have knowledge of numbers, and to seek to accommodate the prev- 
alence and utility of arithmetic in one way or another within a 
nomunalist framework 


l Hartry Field has argued against the claim that arithmetic 15 indispensable to 
science—Science without Numbers (Princeton University Press, 1980) Some accept 
the claim but dispute its epistemological significance. Penelope Maddy, "Indispens- 
ability and Practice,” this JOURNAL, LXXXIX, 6 (June 1992) 275-89, and Elhott 
Sober, “Mathematics and Indispensability,” Philosophical Review, cu (1993) 35-57 
Michael Resnik defends both the claim and its significance—Mathematies as a Science 
of Patterns (New York Oxford, 1997) The indispensability criterion 1s stated in 
Hilary Putnam, "Mathematics without Foundations," m Mathematics, Matter, and 
Method Philosophical Papers, Volume I (New York Cambndge, 1967) Putnam at- 
tributes the criterion to W V Quine 


0022-362X/01/9801/5-18 © 2001 The Journal of Philosophy, Inc 
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Nonunalism i5 the view that there are no abstract entities ? Nomi- 
nalism about arithmetic is the view that either numbers are not 
abstract, or arithmetical sentences, correctly mterpreted, do not en- 
tail the existence of numbers, or arithmetic 1s false Of these three 
brands of nominalism, the first ıs not popular? and so J shall say no 
more about it The task for the second brand is to find a plausible 
interpretation of arithmetic that maintains truth and avoids commit- 
ment to abstract entities * A route favored recently 1s to go modal 
interpret arithmetical statements as about possibilities or possible 
concrete objects A common feature of these attempts 1s that under 
the chosen interpretation, the proposition expressed by an arithmet- 
ical sentence differs from anything we have in mind when using the 
sentence ? So this brand of nominalism 1s prima facie implausible, 
and it leaves open the question of the truth of the propositions we 
actually have in mind when using arithmetical sentences 

The third brand of nominalism about arithmetic 1s fictionalism 
According to fictionalism, arithmetical sentences say just what we 
normally think they say, and our arithmetical theorems have the 
existential consequences we normally think they have, but arithmet- 
ical theorems with existential consequences are one and all false 9 
There are infinitely many primes? False! 2 + 1 = 3°? False! This is 
uncomfortable Simple numerical equations are among the proposi- 
tions that seem most obviously true There are various emolhents to 
alleviate discomfort arithmetic does not have to be true to be useful, 
and we can specify a kind of correctness without appeal to truth 


? What this comes to 1s not clear, as the abstract-concrete distinction 1s hazy But 
It is widely assumed that numbers would have to be abstract For the difficulty of 
drawing the abstract-concrete distinction, see Bob Hale, Abstract Objects (Cambridge 
Blackwell, 1987), chapter 3 

5 The only one I know of who might qualify as a nominalist of this kind 1s Bishop 
Berkeley, who claimed that “numbers are nothing but names”—Philosophical Com- 
mentanes (New York Garland, 1989), 8 763, cf § 803 It1s not clear what he would 
have said about numbers beyond those for which a name has been uttered or 
written down | 

* This 1s dubbed "the hermeneutc strategy” ın John P Burgess and Gideon N 
Rosen, A Subject with No Object (New York Oxford, 1997) 

5 For example, on one nominalist interpretation of aritthmeuc, an arithmetical 
sentence Sis to be understood roughly as follows there could be a model of the 
second-order Dedekind-Peano axioms for arithmetic and necessarily, for any model 
M of those axioms, Sis true in M Clearly, only someone with some knowledge of 
mathematical logic would even entertan a thought of this kind For details, see 
Geoffrey Hellman, Mathematics without Numbers Towards a Modal-Structural Interpre- 
tation (New York Oxford, 1989) For an alternate nomuinalist view, see Charles 
Chihara, Constructibiluty and Mathematical Existence (New York Oxford, 1990) 

$ See Field, and David Papineau, “Knowledge of Mathematical Objects," in A D 
Irvine, ed , Physicalism m Mathematics (Boston Kluwer, 1989), pp 154-81 
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derivability from axioms Moreover, there are various reconstructive 
strategies’ for finding a nominalistic statement capturing “the grain 
of truth” ın a theorem of arithmetic But this satisfies only a lucky few 
Surely, some misunderstanding 1s involved in disbelieving that 2 + 
1 = 3, for example? Thus we may be trapped, stuck at one or another 
unsatisfactory position or oscillating between them 

The aim here 1s to show that this trap 1s avoidable, by arguing for 
the possibility of a naturalistic account of knowing numbers, without 
taking them to be concrete objects I shall try to show that cognitive 
science already provides some of the resources for such an account 
The numbers I have m mind are the finite cardinals some we know 
by acquaintance and some by description The paper takes these 
cases in order, and then something 1s said about knowing finite 
numbers more abstract than cardinals But first I want to say what 
cardinal numbers are, and to present a general setting for the epis- 
temology to follow 


I WHAT ARE CARDINAL NUMBERS? 

Cardinal numbers are properties In my view, they are properties of 
sets, but they might also be properties of concept extensions, collec- 
tions, pluralities, nonmereological aggregates, or some other kind of 
collective, provided that collectives of one or zero items are not 
excluded I shall talk of sets for convenience, but I do not insist that 
only sets have cardinal number As long as one 1s clear about which 
are the 1tems in the collective to be numbered, one need not deploy 
a general collective concept in framing the question What is the 
number of sheep in this flock? In fact, one can do without any 
collective concept, general or specific How many sheep are here? 
Nonetheless, the answer to the question gives the cardinal size of the 
flock, and that is as much a property of the flock as its monetary value 
or the average age of its members 

But do any properties exist? Áre properties and kinds not just a 
philosopher's fiction, or the result of a mustakenly literal reading of 
certam façons de parle? This view was famously expressed by W V 
Quine? 


One may admit that there are red houses, roses and sunsets, but deny, 
except as a popular and misleading way of speaking, that they have 
anything ın common [T]he word ‘red’ or ‘red object’ 1s true of each 
of sundry individual entities which are red houses, red roses, red sunsets, 


7 Burgess and Rosen survey the reconstructive strategies 
8 “On What There Is,” m his From A Logical Point of View (Cambridge Harvard, 
1953), pp 1-19 
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but there 1s not, in addition, any entity whatever, individual or otherwise, 
which 1s named by the word 'redness', nor, for that matter, by the word 
‘househood’, ‘rosehoed’, 'sunsethood' (2bed., p 10) 


Plato? may have thought that for every general term there is a 
corresponding Form Quine held that for no general term 1s there a 
corresponding property or kind Between these two extremes lies the 
“mixed” view that for some but not all general terms does there exist 
a corresponding property or kind Although there is no decisive 
argument for any of these views, I hold that the mixed view should be 
our default hypothesis It 1s true that we rarely assert or deny the 
existence of a property or kind, as opposed to instances of a property 
or kind But although talk of this kind 1s rare, judgments about the 
existence or nonexistence of properties and kinds are sometimes 
made in science For example, Joseph Priestley thought that all 
combustible material contained phlogiston, a substance that 1s liber- 
ated from the material in combustion, with the dephlogisticated 
substance left as an ash or residue Antome Lavoisier thought that 
there was no such substance as phlogiston and no such property as 
being dephlogisticated 

Is this really a case of a difference about the existence of a kind or 
property, rather than about the existence of instances? Of course, 
they did disagree about the existence of instances But there was 
more to it than that On Lavoisier’s view, nothing could contam 
phlogiston or be dephlogisticated Compare this with the case of a 
transuranium element prior to evidence that there exist any atoms of 
that element Before 1996, no sample of the element with atomic 
number 112 had been found and, as far as was known, there was none 
m existence Yet ıt was held that there could be some, and in fact 
some was produced (by bombarding lead with zinc) This is the 
current situation for the element with atomic number 114 !? Thus, 
scientists view the following as an epistemic possibility there ıs no 
sample of element 114 but some could be produced Although it 1s 
also believed that there ıs no sample of phlogiston, it 1s not also 
believed that some could be produced What explains the difference? 
Surely, ıt ıs not that present technology for producing phlogiston lags 
behind present technology for producing superheavy elements, nor 
even that there 1s an insuperable technological barrier, such as an 


9 The Repubhc, G M A Grube, trans (Indianapolis Hackett, 1992) 596a X 
corresponds to general term T'when X1s designated by the singular term formed from 
T (for example, T-ness, or in Plato’s writing, the T itself) 

1? By the time you read this, the claim may be out of date, but the philosophical 
point 1s unaffected 
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unattainable energy requirement, to phlogiston producton The 
answer 1s that while there ıs such a kind as element with atomic 
number 114 and such a property as having a nucleus with 114 
protons, there ıs no such kind as phlogiston and no such property as 
being phlogiston 

Are we being misled by popular ways of speaking here? Presum- 
ably, we have a naive linguistic disposition to generalize existentially 
on a noun phrase, and the idea that kinds and properties exist might 
be nothing but an unconscious conversion of this disposition into an 
ontological thesis, regardless of empirical justification But in hold- 
ing the mixed view (that some noun phrases for properties or kinds 
do designate while others do not), we clearly resist that disposition 
On the mixed view, decisions about what kinds and properties are 
real ıs to be based on empirical findings The extreme views, by 
contrast, issue from metaphysical prejudice, as they are unlikely to be 
accepted by anyone without a platonist or a nommalist bent !! For 
these reasons, I am going to take the mixed view as the default 
position and use it as a background premise That of course leaves 
open the question whether there are such properties as cardinal 
numbers The affirmative view 1s not confined to those with a general 
predisposition to platonism John Locke,!? for example, included 
number in the list of what he was prepared to call “real qualities " But 
this view needs vindicating The challenge is to show how, if there 
were cardinal number properties, we could have knowledge of them 
I shall try to meet that challenge 1n what follows by sketching a 
naturalistic account of such knowledge 


Il ACQUAINTANCE WITH CARDINAL NUMBERS 
We say that we know John Doe, but for entities other than people, talk 
of knowing them 1s unusual and sometimes odd But not always on 
hearing the opening bars of a symphony, one might say ‘I know that 
music’, on seeing the handwritten address on an envelope, one might 
say ‘I know that handwriting’ Here one claims to recognize the 


H At the time Quine wrote “On What There Is,” he was not the (reluctant) 
Platonist that he later became That article was first published in 1948, a year after 
the publication of “Steps towards a Constructive Nominalism," Journal of Symbolic 
Logic, x11 (1947) 105-22, co-authored with Nelson Goodman In that article they 
declare their belief ın the nonexistence of abstracta to be a "philosophical intuition 
that cannot be justified by appeal to anything more fundamental” (p 105) 

12 An Essay Concerning Human Understanding [1689] (New York Oxford, 1975), 
Book II, Chapter VIII, § 17 One might argue that what Locke really had 1n mind 
here ıs not the number n but the property of having n atomic constituents 
("soltd parts”) But this reading of the passage is dubious and cannot easily be 
transferred to other passages ın which Locke seems to say that number ıs a 
primary quality 
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music or handwriting, and 1f the claim 1s true one must be acquainted 
with it 13 In these cases, acquaintance entails that one has already 
heard performances or seen samples, and from this prior experience 
one has acquired a concept of the music or handwniting, and an 
ability to recognize the music or handwriting ın other performances 
or samples, where this involves thinking of them under a relevant 
concept Generalizing, J shall take 1t as sufficient for acquaintance 
with a property that one has experienced instances of it, and as a 
result one has acquired a concept of the property and an abihty to 
recognize other instances and to discriminate them from nonm- 
stances (unless circumstances are unfavorable), where this involves 
applying the concept exclusively to the instances !* 

What does recognitional ability consist in when the property con- 
cerned is a cardinal number? It 15 not enough to be able to tell 
whether a given set has the cardinality By careful counting, we can 
tell whether a given set of stationary visible objects whose cardinality 
lies somewhere in the 200-to-300 range has cardinality 271 That 
would be mere detection rather than recognition Recognition of n 
requires that we have some sense of the cardinal size n as distinct 
from its neighbors For which cardinals do we have such a sense? 
There ıs evidence that we have an innately given magnitude repre- 
sentation of rough cardinal size, or numerosity, with a neural basis in 
the inferior parietal lobes !5 The evidence comes from a variety of 
sources experiments on healthy adults and children, clinical tests on 
brain-damaged patients, brain-imaging techniques, and studies on 
animals from parrots to primates 1® The capacity for representing 
numerosities shares a couple of features with other innate rough- 
quantity senses, such as sense of duration, temperature, and pitch 
These features are known as the distance effect and the magnitude effect 
Crudely put, the distance effect 1s that the smaller the difference 


18] take it as obvious that one would not be claiming to know that very perfor- 
mance of the symphony or those very ink marks on the envelope What one claims 
to know 1s the type rather than any of its tokens 

14 An anonymous referee points out that this account of property acquaintance 
sounds perfectly acceptable to an antirealist about properties who favors concep- 
tualism While this may be true, ıt ıs not in conflict with my aim at this point, which 
15 to argue that, if there were cardinal-number properties, we could have knowledge 
of them My reason for taking seriously talk of properties 1s given in the previous 
section 

15 This 15 what Stanislas Dehaene refers to by the title phrase of his book The 
Number Sense (New York Oxford, 1997) 

16See Brian Butterworth, The Mathematical Brain (London Macmillan, 1999), 
chapters 3-6 This was published ın the United States as What Counts How Every 
Brain Is Hardwired for Math (New York Free Press, 1999) See also Dehaene, 
chapters 1-3, 7, 8 
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between two quantities (for a fixed mean), the harder it 1s to distin- 
guish them, the magnitude effect is that the greater the mean of two 
quantities (for a fixed difference) the harder it 1s to distinguish 
them !7 Difficulty of discrimination 1s measured by response times 
and error rates for tasks requiring comparison of the quantities The 
pattern of response times and error rates 1s part of the data to be 
explained, the fact that this pattern is the signature of rough-quality 
sensing supports the view that in making number judgments of the 
kinds, tested subjects are sensing rough qualities 18 Putting together 
the distance effect and the magnitude effect, it follows that in using 
our sense of numerosity to distinguish adjacent numbers, it will be 
easier for small numbers, and this is how it seems to be There 1s 
evidence from studies with very young children and animals for a 
prelinguisuc ability to discriminate cardinal numbers 1, 2, and 3 This 
ability may be provided by our numerosity sense, something that 1s 
predicted by a neural network model for this sense 19 This prelin- 
guistic ability is matched by an extremely fast and reliable ability in 
adults to sense the cardinal numbers of sets of 1, 2, and 3 visually 
presented items, known as subitizing?? So we can sense these cardinal 
numbers, and once we have concepts for these cardinal numbers, we 
can recognize instances of them and discriminate them from nonin- 


17 For precise statements of these features, see any textbook in psychophysics 
under “Fechner’s Law” and “Weber’s Law” 

18 Thus it 1s a mistake to think that the only data we have are data to the effect that 
we know facts of the form ‘There are n so-and-sos’ for small n This, I hope, partly 
answers the question of an anonymous referee how such data justify the claim that 
we know numbers as objects In fact, I do not want to claim that we know numbers 
as objects, and 1f some formulations suggest otherwise, please regard that as a slip 
My claim is that a naturalistic account of knowledge of numbers is not impossible 
if we take cardinal numbers to be properties 

1? Dehaene and Jean-Pierre Changeux, "Development of Elementary Numerical 
Abilities A Neuronal Model," Journal of Cognitwe Neuroscience, v (1993) 390-407 
The hypothesis that the numerical abilities of animals and infants are based on a 
sense of numerosity 1s put forward by Dehaene ın The Number Sense among others 
This ıs disputed ın Susan Carey, "Evolutionary and Ontogenetc Foundations of 
Anthmetc,” Mind and Language (forthcoming) Carey's view ıs that we have the 
sense of numerosity but that performance of numerical tasks by animals and infants 
uses other cognitive resources 

20 Subjects are asked to indicate the number of presented items flashed on a 
screen as quickly as they can When response tme 1$ plotted as a function of 
number of items viewed, there 1s a discontinuity in slope almost horizontal for small 
numbers (too fast for silent counting), steeper for larger numbers (matching 
increments in counting time) Error rates show the same pattern The discontinuity 
persists under diverse display conditions This ıs evidence for different processes of 
enumeration a fast, error-free process for small numbers, that 1s, subitizing, and a 
slower, less reliable process for larger numbers (The subitizing limit varies between 
3 and 7, depending on type of experiment, visual display, statistical techniques used 
to calculate subitizing limit, and individual differences ) 
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stances (1n particular, from instances of adjacent cardinal numbers), 
as we do in subitizing 

What kind of experiences would help us get concepts of cardinal 
numbers??! For small nonzero cardinal numbers, our experiences of 
counting would be a natural first call But one has to be careful here 
What does counting consist in? On some definitions, counting might 
prerequire concepts of cardinal numbers, 1n which case our acquisi- 
tion of the concepts could not be the result of counting experience 
Let us take counting to include the one-to-one assignment of words 
in a stable order with a perceptually given set of entities That 1s, to 
be a competent counter a child must obey the stable-order principle 
and the one-one principle (allowing for shps) the counting words 
must be used in the same order in different counts and they must be 
assigned one-to-one with the items being enumerated ?? A child who 
obeys the stable-order and one-one principles may still not know what 
counting 1s for Even when a child knows that the correct answer to 
the question ‘How many?” 1s given by the last word in the count, she 
may not understand the question as a request for the number of items 
counted ?? Experimental results show that children come to realize 
that the last word in a correct count signifies the number of the 
objects counted only after they have acquired the stable-order prin- 
ciple and the one-one principle That is, they learn the cardinal 
significance of counting only after learning correct counting proce- 
dure ?* At about the same time, they come to know which cardinal 
number each word ın their counting list denotes 7° I assume that m 


21 It 1s possible that we have innate concepts of cardinals 1, 2, and 3 Experiments 
of Karen Wynn suggest that infants as young as five months make numerical 
predictions when numbers are hmuted to three—“Addition and Subtracaon by 
Human Infants,” Nature cccLviu (1992) 749-50 Wynn's findings have been 
replicated many times under varying conditons On one explanation of these 
findings, fants have genuine numerical beliefs, which have concepts for the 
cardinals 1, 2, and 8 as components The explanation of these findings 1s under 
dispute, however 

22 For counting principles, see Rochel Gelman and Charles Gallistel, The Chald's 
Understanding of Number (Cambridge Harvard, 1978), chapter 7 

23 Children at this stage someumes treat a number word as a variable designator, 
like a demonstrative, taking 1t to refer m a given count to the item to which it 1s 
assigned See Karen Fuson, Children’s Counting and Concepts of Number (New York 
Springer, 1988), and Wynn, “Children’s Understanding of Counting,” Cognition, 
xxxvi (1990) 155-93 

24 Wynn (2bd ) found that the children in her study acquire the cardinal principle 
at around the age of 3 ¥% years, after they have learned correct counting procedure 
This 1s contrary to the understanding-before-skill hypothesis of Gelman and Gallis- 
tel, but accords with the reverse hypothesis of Fuson 

25 Wynn, “Children’s Acquisition of the Number Words and the Counting Sys- 
tem," Cognitive Psychology, xx1v (1992) 220-51 Wynn found that children take the 
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the process they acquire concepts for those cardinal numbers How 
this happens ıs not clear It 1s not improbable that part of the process 
involves mentally associating representations of initial words in the 
counting list with inital representations supplied by the sense of 
numerosity ?? An additional possibility is that in learning to count we 
construct a category representation of sets of a given size, one for 
each set size from 1 to 3 ?” These might then serve as representations 
of those cardinal numbers and get mapped onto the initial numer- 
osity representations Our counting experience might also lead us to 
associate adding a further item to the set of those counted with 
proceeding from one cardinal number to the next one up in order of 
size So our concept of cardinal 2, for example, would present 1t as 
the successor of 1 as well as the number between 1 and 3 Thus, 
experience of the counting procedure and of its use may feed into 
the acquisition of concepts for cardinal numbers 1, 2, and 3, and 
enable us to recognize new instances of each and discriminate them 
from instances of the neighboring cardinal numbers In this way, we 
can have acquaintance with these numbers 

Do we have acquaintance with cardinal numbers greater than 3? 
My inchnation ıs to think that we do In our experience of counting, 
we repeatedly meet and notice numbers from 4 to 10, as finger 
counting typically has an important developmental role 78 In so 
doing, we might sharpen our sense of numerosity to get senses of 
individual cardinal numbers beyond 3, Just as repeatedly exercising 





number words in their list to stand for umque cardinalines before knowing which 
cardinality each one denotes They may reach this stage through experience of the 
use of number words outside counting contexts, reaching the later stage only when 
they make an order-preserving association of number words in counung sequence 
with cardinals ın order of size, perhaps by some general analogizing mechanism 
See also Fuson, “Relationships Between Counung and Cardinality from Age 2 to 
Age 8” in Jacqueline Bideaud, Claire Meljac, and Jean-Paul Fischer, eds , Pathways 
to Number (Hillsdale, NJ Lawrence Erlbaum, 1992), pp 127-49 

26 There 1s clinical evidence for neural links between the sites m the left hemi- 
sphere of the numerosity sense and of the aural number word store (inferior 
parietal lobe and perisylvian cortex) See Butterworth, chapter 4, and Dehaene, The 
Number Sense, chapter 7 Presumably these links are formed when children learn the 
cardinal meanings of the number words 

?7 Perhaps by “abstracting” from the nature of the elements of mmembered sets 
and the order in which the members are given, as Georg Cantor suggested, we form 
a representation of the cardinal number n— Contributions to the Founding of the Theory 
of Transfinite Numbers Y (1895), Philp E B Jourdam, trans (New York Dover 
1955), §1 This 1s not to endorse Cantor’s view that these representations are the 
cardinal numbers 

28 See Fuson, “Relationships between Counting and Cardinality from Age 2 to 
Age 8” For the neuropsychological importance of finger counting, see also But- 
terworth, chapter 5 
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one’s visual capacity for discriminating shades of red or types of snow 
sharpens our sense of those different properties or kinds From 
counting experience, we get a sharpened sense of the cardinal size 4 
Our sense of 4 may not be quite as clear and strong as our sense of 
3, but the difference may not be great Similarly, we may develop a 
sharpened sense of subsequent numbers, each almost as clear and 
strong as our sense of its predecessor It 1s possible that there 1s a 
relatively sudden drop in the sharpness of the sense of numerosity 
thus developed, perhaps after 10, but I am not aware of evidence for 
it If the contrary 1s true, there would be no natural cut-off pomt 
separating those cardinals we can recognize and those we can only 
detect, consequently, acquaintance with cardinals would have to be a 
matter of degree But there ıs no ground for worry here, as that ıs 
how 1t 15 anyway for acquaintance with. material 1ndividuals 


II KNOWLEDGE OF CARDINAL NUMBERS BY DESCRIPTION 

Small numbers beyond 3 may be known by acquaintance But even if 
they are not, they are known by description through famihanty with 
small number addition. The cardinal number 5, for example, may be 
known as the size of a set composed of a triple and a pair (without 
repetition or overlap) Clearly, all the numbers from 4 to 10 can be 
easily and briefly described this way without assuming knowledge of 
cardinal numbers beyond 3 Early addition exercises performed us- 
ing counting techniques give us knowledge of all two-addend cardinal 
sums with totals up to and including 10 ?? Thus, for each number 
from 4 up to 10 there are several descriptions of this sort (sum of 
cardinal x and cardinal y) by which we know it, and we have more of 
these descriptions for larger numbers, so that our richer descriptive 
knowledge of larger numbers makes up for our weaker direct sense of 
them °° 

This kind of knowledge takes us up to 10, perhaps to 20, perhaps 
even to 100 What about knowledge of much larger numbers? There 
are many possibilities, but probably just a few basic kinds of descrip- 
tion that most of us use 3! As multidigit numerals (in the decimal 


?? For a developmental account, see Fuson, “Relationships between Counting and 
Cardinahty from Age 2 to Age 8” 

°° The cardinal number 0 may (must?) be known by descripton—for example, 
the number of wild tigers in Scotland, the result of taking away two things from a 
set of two Perhaps one does not think of ‘0’ as naming a cardinal number (rather 
than an indicator of absence) ahead of accepting that there 1s an empty set Ido not 
know of empirical work on this question 

31I have in mind people who are not calculating prodigies but have a level of 
schooling in arithmetic that 1s standard 1n North America and Europe I am not 
suggesting that this standard 1s very good or better than standards elsewhere 
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place system) play such a big role m numenical thinking, 1t 1s natural 
to look to them for clues An obvious thought ıs that a multidigit 
numeral ıs short for the corresponding polynomial description the 
cardinal 271 1s the cardinal of a set composed of 2 hundreds, 7 tens, 
and 1 one This seems plausible and can clearly be extended using 
greater powers of ten But we would need to say how we know of 
those powers of ten This 1s easy for the case of a hundred, which 1s 
thought of as a set of 10 tens But for higher powers the question 
arises whether our knowledge of the cardinal number really depends 
on its description in terms of powers Do we think of the cardinal 
2,710,003 as the cardinal of a set composed of 2 tens-to-the-power-of- 
six, 7 tens-to-the-power-of-five, and so on? Do we not first think of a 
ten-to-the-power-o£six as the number of things denoted by a ‘T’ 
followed by six ‘0’s? The actual story may be quite complicated But 
whatever it 1s, we clearly could come to know cardinal numbers 
beyond 10 under the polynomial description, using a recursive defi- 
nition for powers of ten a ten-to-the-power-of-two 1s 10 tens, a ten- 
to-the-power-of-n+ 1 1s 10 tens-to-the-power-of-n 

On the suggestion just made, we know larger numbers under their 
descriptions as polynomials, with multidigit numerals serving as the 
descriptive expressions There ıs an alternative, m which the multi- 
digit numerals play a more intimate part ın our knowledge of num- 
bers the numerals are not merely the carriers of the description but 
are part of the descriptive content. How 1s this going to avoid circu- 
larity? If we think of the number 27176 as the number denoted by 
‘27176’, that cannot be the whole story Through our knowledge of 
an algorithm for determining the numeral immediately preceding 
any given multidigit numeral and our possession of a concept for the 
cardinal successor function, we can amplify as follows the number 
denoted by ‘27176’ 1s the cardinal successor of the number denoted 
by the numeral predecessor of ‘27176’ Of course, this works only for 
someone who has knowledge of the number denoted by ‘27175’ But 
the same kind of description 1s available, using the following general 
schema the number denoted by a multidigit numeral p is the cardi- 
nal successor of the number denoted by the numeral predecessor of 
p. Repeated application brings us back to single-digit numerals 
whose denotation we know by direct association with cardinals with 
which we are acquainted In this way, circularity is replaced by 
recursiveness I am not suggesting that we actually run back in 
thought from a multidigit numeral ‘27176’ to single digits in thinking 
of the number it denotes We just think of the number as the number 
denoted by ‘27176’, but our understanding of this description (an 
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terms of numeral predecessor and cardinal successor functions) suf- 
fices to fix its reference That is an alternative possibility ?? 
Probably the actual way in which we think of multidigit numbers ıs 
more complicated than either of the two suggestions made here, 
perhaps involving a mixture. the polynomial description for numbers 
of two to four digits and the apparently circular description m terms 
of numerals for larger numbers Even this 15 liable to be an oversum- 
plification, as 1t ignores how natural-language expressions for larger 
numbers affect how we think of them On top of this there will be 
variations among people and over time But my aim here is not to 
give “the correct account” of our knowledge of numbers by descrip- 
tion It 1s merely to show that such an account 1s possible without 
positing powers that could not be countenanced in cognitive science 


IV ENOWING THE OBJECIS OF PURE NUMBER THEORY 

Even 1f my case for the possibility of a naturalistic account of knowing 
cardinals is accepted, one might protest that we still lack a parallel 
case for the objects of number theory Those objects are not set sizes, 
they are, if anything, positions m the natural number structure 35 I 
am sympathetic to this vew Pure mathematicians at work m number 
theory are not concerned with finite cardinals as opposed to finite 
ordinals and the myriad other systems that exemplify this structure, 
they are concerned with the structure itself Hence numerals and 
variables in the context of pure number theory are used to refer to 
(or range over) positions 1n the structure But how could anyone 
have knowledge of such highly abstract objects, positions in a struc- 
ture? 

Although it lies beyond the scope of this paper to answer this 
question ın any detail, I think an outline 1s possible First, we would 
have to have knowledge of the structure itself Assuming that we 
cannot have acquaintance with more than a finite portion of an 
infinite structure, knowledge of the structure must be knowledge by 
description, a description whose components are inferred from other 
knowledge One possibility 1s that we derive a description of the 
natural-number structure from our knowledge of the system of nu- 


?? Neither of these accounts 1s intended to show how we might know all finite 
cardinals That would be impossible, as there 1s a finite upper bound on the 
number of possible neural states, hence on the number of representations (sensory 
or descriptive) that the mind can grasp 

33 More than one structure may be described as "the" natural-number structure, 
depending on the relations considered For simplicity, we can take the sole relation 
to be the lessthan relation For structuralist views of the ontology of pure mathe- 
matics, see Stewart Shapiro, Philosophy of Mathematics (New York Oxford, 1997), part 
II, and Resnik, part III 
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merals 1n order of precedence, which is one of the systems having 
that structure I assume that we are acquainted with some numerals 
(types, not tokens) and that we know about the system as a whole 
through our knowledge of algorithms for construcung the successor 
and predecessor of a given numeral, and for addition on the numer- 
als Wnrung |x| for the number of predecessors of numeral x,?* u + 
k 1s the numeral constructed by |x| applications of the successor 
operation starting from numeral u Precedence 1s understood ın the 
familiar way p < vif and only if for some K other than ‘0’, w+ K = 
v Through standard arguments we know that the order of prece- 
dence 1s a strict total ordering having Just one element that 1s not a 
successor and no element that 1s not a predecessor In fact, 1t 15 a well 
ordering, because from any numeral p one can reach '0' by |u| 
applications of the predecessor operation, and so there is no unend- 
ing sequence of ever earlier numerals, hence every nonempty set of 
numerals has an earliest member With this knowledge of the struc- 
ture, as a well-ordering with one initial position and no terminal 
position, we can know a position as the n‘ position of the structure 
For any numeral not too long to be grasped, we can know and name 
the position that 1t occupies and all preceding positions 

That ıs my outline I concede that we are still some way off from 
being able to fill in the cognitive details of a naturalistic account of 
our knowledge of the natural-number structure, hence of its posi 
tions But I am aware of no reason to think that future developments 
ın the empirical study of our cognitive capacities will not one day 
provide the resources for a full account along the lines of the sketch 
just given 


V CONCLUSION 
Among philosophical attitudes to arithmetic prevalent today 1s a kind 
of skepticism the theorems of arithmetic are not true, unless they are 
interpreted in one or another strange way One of the main motiva- 
tions for this attitude 1s the conviction that there could not be a 
naturalistic mode of knowing numbers I have tried here to rebut that 
conviction Drawing on recent work ın cognitive science, I have 
sketched ways 1n which we might have knowledge of cardinal num- 
bers I have not canvassed all the possibilities, and 1t may be that none 
of those presented here are nght My claim 1s only that they, or 
something simular, could be right, so, I conclude, we should not 


34 ĮI assume that ‘0’ ıs the first numeral So |x| ıs the cardinal denotation of K 

35 This terminology is slightly misleading An imitial position 1s one that 1s not the 
successor of another position So the position of any limit ordinal is initial (as ıs the 
position of 0) 
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dismiss the very possibility of explaining knowledge of cardinal num- 
bers within a scientific framework 

What has prevented people from seeing the kind of possibilities 
outlined here? There are probably several obstacles Here are my 
suggestions One has to do with the word ‘object’ If cardinal num- 
bers exist, they are objects, we say That dictum 1s a truism if the word 
‘object’ is used to mean ‘entity’ But cardinal numbers, being prop- 
erties of sets, are very unlike prototypical objects, which are bounded, 
perceptible, spatiotemporal continuants Moreover, the word ‘object’ 
1s often used contrastively with ‘property’ So the dictum may obscure 
the fact that cardinal numbers are properties and incline one to think 
that, 1f there were such things as cardinal numbers, they would have 
to be something like disembodied billard balls A second obstacle 
may be a nommalist reluctance to accept the existence of properties 
A third obstacle may be that the empirical study of cognitive capac- 
ities for number knowledge is relatively young—though it ıs now a 
vibrant research subject Finally, philosophers of mathematics have 
shown a near total disregard for the empirical findings, despite a 
willingness to pronounce on cognitive matters 9 I hope that this 
paper shows the merit of abandoning that attitude 

MARCUS GIAQUINTO 

University College London 


3 A notable case ıs Philip Kitcher's claim that children come to learn the 
meaning of ‘number’ and to accept basic truths of arithmetic "by engaging in 
activites of collecung and segregating"— The Nature of Mathematical Knowledge (New 
York Oxford, 1984), chapter 6 Kitcher gives no reference to empirical work on 
children's cognitive development, and his view ıs not borne out by the findings 
earlier published ın Gelman and Gallistel 
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THE RATIONALITY OF SCIENCE AND THE RATIONALITY 
OF FAITH* 


f science 1s rational, why 1s ıt so rare 1n human history? If faith 1s 
wrational, why have the great majority of people always preferred 
traditional beliefs to new ones obtained by independent, critical 
investigation? The prevailing answer 1s that nonscientific beliefs de- 
rive primarily from nonrational causes, and, unfortunately, the his- 
tory of the world 1s largely dominated by unreason I think that this 
answer 1s false I see the conflict between science and faith as result- 
ing not from the struggle of rational thought against irrational forces 
of conformity, but from a systematic tension between two aspects of 
rationality itself, which I call simply subjective and objective I make 
two main controversial claims I claim that most people rationally (in 
the more basic, subjective sense) ought to believe in their local tradi- 
tions, because that 1s what all of the people around them believe, and 
few individuals are 1n a total epistemic situation from within which 
they can reasonably contradict their neighbors I also claim that most 
scientists and other modern intellectuals ought rationally (ın the 
same sense) not to believe ın their own theories, though ıt 1s often 
good for others in the long run when they do This ıs not to deny that 
scientific research 1s objectively a rational process, that 1s, one that 
leads to true beliefs for people ın general But I clam that this 
requires a kind of epistemic altruism on the part of experimental 
thinkers, ın that their new ideas are ordinarily less likely (from the 
thinkers’ own, internal points of view) to be correct than are the 
opinions they have received from others 
My argument has the following structure In section 1, I specify the 
difference between my conceptions of subjective and objective ration- 
ality, and make a few other preliminary distinctions that are needed 
for a clear exposition of my main points In section II, I defend as 
subjectively rational a broad principle of deference to the beliefs of 
others, including most traditional religious beliefs I also consider 
two main objections, one derived from David Hume and one from 
René Descartes In section 11, I discuss the combination of objective 
rationality and subjective irrationality which accounts both for the 


* I would like to thank Larry Blackman, Ellery Eells, Berent Enc, Richard Fum- 
erton, Alan Sidelle, Walter Soffer, Dennis Stampe, and Stephen Sullivan for helpful 
comments on this paper I would also like to thank SUNY-Geneseo for its support 
through a Presidential Summer Fellowship 
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great success of science, and for its being so rare In section 1v, I 
consider briefly whether this new view leads into paradox when 
applied to itself 


I DISTINCTIONS 

There are three intuitive distinctions which I want to discuss in 
advance of my main arguments subjective versus objective rational 
belief, first- versus second-order rational belief, and rational belief 
versus knowledge There is nothing very new or very technical here, 
I just want to avoid certain natural misunderstandings 

Pure rationality itself 1s a single, formal thing—a connection be- 
tween bodies of evidence and sets of beliefs, through chains of proper 
inference But a useful distinction can be drawn between different 
applications of this single concept, ın terms of the available bodies of 
evidence I say that a belief p ıs sudjectevely rational for a person Sata 
time £ 1f pis what S rationally ought to believe at t, given all of the 
evidence available to Sat or before t I do not mean whatever seems to 
the person to be rational I mean what someone really ought to 
beleve, not just what he thinks that he ought to believe, given all of 
the evidence that he has had so far This strong conception will rule 
out as fully rational some people who have made errors m reasoning 
in the past that they cannot now correct Weaker notions of subjective 
rationality are no doubt worth exploring, but one of my goals here 1s 
to make a strong defense of traditional behevers, so I am requiring 
them to be completely “epistemically responsible" agents, beyond any 
relevant criticism respecting the rationality of their background be- 
hefs ! 

I shall not try to define ‘rationally ought to beleve’ more precisely, 
common intuitions should be good enough ? My immediate point 1s 
just that different people frequently have access to different pools of 


1 Some wnters have used the terms ‘objective’ and ‘subjective’ to mark the 
difference between really and apparently rational belief, that is, what merely seems to 
someone to be rational belief Richard Feldman has argued plausibly that this 
distinction tends to collapse when looked at closely, unlike a similar distinction in 
ethics between really and apparently right action—"Subjective and Objective Justi- 
fication in Ethics and Epistemology,” Mon:st, LXXI (1988) 405-19 

? A quick guess would be that S ought rationally to believe p just in case f has a 
subjective probability for S of greater than 50% That is, p should be more hkely 
true than not, from S’s point of yew But this may be too low, or too uniform, a 
standard for belief, and in any case the proper definition of subjective probability 
1s also controversial My subjectively rational belief ıs approximately what William 
Alston means by deontologically justified belief—“Concepts of Epistermc Justifica- 
tion,” m his Episteme Justification Essays n the Theory of Knowledge (Ithaca Cornell, 
1989), pp 81-114 Alston’s conception 1s much more elaborately defined, to re- 
spond to problems about which my view 1s indifferent 
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evidence, so 1t can be equally rational for them to come up with 
different beliefs. All such beliefs, if they are properly inferred from all 
the evidence available, will be subjectively rational for the people who 
hold them 

I also want to distinguish this private aspect of rationality from 
public or objectrve rationality, which determines what anyone ought to 
believe, given whatever evidence 1s generally available at the tme I 
realize that this 1s vague, and I apologize for using the word 'objec- 
tive’ here to mean something like globally intersubjective It might 
have been better to reserve the term ‘objective rationality’ for Just the 
formally correct relation of belief to evidence But I am trying to get 
at what we mean when we say that some belief 1s rational simpliciter, 
not expressly relative to one believer or a certain pool of evidence 
This common usage, ın statements like ‘it 1s not rational to believe 
that the world is flat', seems to require an implicit reference to the 
pool of evidence to which we (that 15, some relevant class of normal 
people) all have some sort of access When we talk about the evidence 
for a belief, as distinct from someone's evidence m particular, this 1s 
the pool we mean 

So, relative to the premises that Socrates is a man, and that all men 
are mortal, ıt is rational (in the pure, formal sense) to conclude that 
Socrates ıs mortal Ifa person S at time t justifiedly believes the two 
premises, then it 1s subjectively rational for S at t to believe the 
conclusion Since it 1s public knowledge that Socrates is in fact a man 
and that all men are mortal, we can say that it 1s objectively rational 
(or, as we often put it, just plam rational, or rational given the 
evidence) to believe that Socrates 1s mortal 

The Socrates example is one of deductive reasoning, a matter of a 
logically necessary connection between the evidence and the conclu- 
sion drawn But I shall take it for granted that there 1s such a thing as 
inductive reasoning as well If we are not to be skeptics about empir- 
ical beliefs in general, then we must allow as rational some beliefs that 
depend on inductive (or perhaps “abductive,” or anyway, nondeduc- 
tive) generalization and prediction Both the nature and the justifi- 
cation of inductive inference are mysterious, but whatever 1s the nght 
connection between empirical evidence and empirical conclusions, it 
should apply ın both the subjective and the objective case For 
example, 1t may be subjectively (inductively) rational for one untu- 
tored person in Europe to believe that all swans are white, and for 
another ın Australia to believe that all swans are black, while it 1s 
objectively (inductively) rational, that 1s, rational given the pool of 
evidence available to people in general, to believe that some are white 
and some are black 
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As the products of subjective rationality for two different agents can 
conflict, so can either one conflict with those of public or objective 
rationality A certain person at a certain time may be denied important 
evidence that most other people have, and therefore come to different 
conclusions from theirs, without committing any errors in reasonmg Or 
an individual may have some extra evidence, beyond what lies before the 
general public, but which might turn out to be misleading For example, 
I may have been kidnapped and returned to Earth by aliens, with all of 
the extraordinary experiences such an event 1s liable to involve, though 
nobody else on Earth has any evidence of this fact So only I beheve the 
truth about what happened to me, and everybody else who disagrees 1s 
wrong— but that is not their fault Or perhaps it was all an incredibly 
well-wrought hoax, perpetrated on me by one Steven Spielberg at a cost 
of $160,000,000, for no reason of which I could be aware In this case, I 
am the one who 15 wrong— but it ıs not my fault In the example, we are 
all reasonable people (except maybe Spielberg), all doing our best with 
the imperfect evidence at hand This is the model which I wish to apply 
below to disagreements over rehgion and many other questions, where 
it often happens that one side (or each side) considers the other 
irrational I want to argue that the main source of disagreement in many 
such questions 1s that people on the different sides have different total 
evidence The people on both sides are drawing the conclusions that as 
individuals they rationally ought to draw from their endence Sometimes 
that evidence 1s misleading, in which case the conclusions drawn, on one 
or both sides, turn out to be false And this ıs not anybody's fault, no one 
is makmg a mistake 1n reasoning Sometimes we all just get stuck 

The second main distinction that I want to make 1s between two 
levels or orders of evidence, belief, rational behef, and the hke, 
regarding testimony First-order evidence 1s whatever 1s available to an 
individual without dependence on the word of others Whatever 
comes to me directly through my senses, memory, and faculties of 
inference 1s my first-order pool of evidence Second-order evidence 1s 
whatever one can access only indirectly, through reliance on the word 
of others So if I hear someone say to me that 1t is raining in Paris, 
then I have first-order evidence that such-and-such a person has made 
the sound, ‘It ıs raining ın Paris’, and I have second-order evidence 
that ıt 1s ramming in Paris One's first-order beliefs are then those 
based enurely on first-order evidence, and one’s second-order beliefs 
are those based to a significant extent on testimony? First- and 


3 I am ignoring Wittgensteinian claims about the essentially second-order nature 
of meaning It does not matter to my arguments here whether or not first-order 
experiences can be articulated in a purely private language 
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second-order rationality may then be defined as the correct reliance 
on first- and second-order evidence to form first- and second-order 
beliefs 

A common question about second-order, testimonial beliefs is 
whether they can be reduced to, or rationally derived from, more 
basic first-order beliefs Hume thought that they can, Thomas Reid 
and several recent writers* have claimed that they cannot This ıs not 
a central issue here It is enough for my immediate purposes that 
testimony be seen as a reasonable form of evidence, derivative or not 
that the fact that someone says something sometimes counts ration- 
ally in favor of someone else believing ıt Typically, at least, this will 
happen when the second person has good reason to believe that the 
first person is a reliable source of information, at least on the topic in 
question Thus, the efficacy of second-order evidence will seem to 
depend, in practice if not in principle, on first-order evidence about 
its sources 

I do not claim that this distinction between first and second epis- 
temic orders ıs at all crisp For example, there are plenty of mixed 
modes like telescopes, which can rationally be relied upon by ndi- 
viduals only through a justified trust in the craftsmen who made 
them, their instructions, and so on Even our own sense organs take 
on this mixed character, once they have been examined and pro- 
nounced however good they are by reliable others, which must affect 
our confidence in using them thereafter But the distinction should 
be clear enough for present purposes the arguments that I want to 
present about first- and second-order thinking are not very sensitive 
to such details 

Finally, I need to make it clear that I am talking only about rational 
or justified belief, as distinct from knowledge 5 We allow ourselves to 
say that people rationally believe things that they do not know There 
are three main reasons for this First and most obviously, knowledge 
implies truth, and many justified beliefs are false I may have believed 
with good reason that my wife was cheating on me, and then discov- 
ered (maybe only after I have strangled her) that I had been expertly 
deceived toward that conclusion by my best friend—so, plainly, I 
could not have known what was not true to begin with Second, even 
ignoring the issue of truth, the term ‘knowledge’ implies greater 


* For two main examples, see C A J Coady, Testimony A Philosophical Study (New 
York Oxford, 1992), and Tyler Burge, “Content Preservation,” Philosophical Review, 
cu, 4 (October 1993) 457-88 

5 For convenience, I shall ignore the issue of probabilized beliefs or degrees of 
belief, and just treat belief as an up-or-down decision 
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certamty than mere belief I may believe with good reason that Ralph 
Nader will lose the presidential election m the United States in 2000, 
and be nght, but not know it in the sense of bemg truly certain 
Third, knowledge implies understanding in a way that simple justified 
behef does not I may believe with good (second-order) reason that 
photons exhibit something called wave-particle duality, but I barely 
understand what this clarm means in terms of the theory that explains 
it—hence, 1n a way, I do not really know that the claim 1s true 

I am particularly concerned to avoid a side issue over whether there 
1s such a thing as second-order knowledge, in the full sense of the 
term Plato for one, and John Locke for another, firmly deny that 
mere testimony can ever provide us with real knowledge ê But this 
depends on their strong view of knowledge as something like under- 
standing, not justified true belief Both Plato and Locke do accept 
that testimony can often be used to transfer mere justified or rational 
belief (true or otherwise) from one person to another This weaker 
claim is all that I need for my argument that most people do believe 
what they rationally ought to believe 


II THE SUBJECTIVE RATIONALITY OF FAITH 
Sometimes, the total pool of evidence we have will make 1t rational for 
us to override our own tentative, first-order conclusion about some- 
thing in favor of someone else’s expressed opinion This will typically 
occur when we have sufficient reason to believe that the other person 
or group has a more reliable overall view of the question than we do 
To the extent that another person 1s an eyewitness, or an expert, and 
we are not, it ıs likely to be rational for us to forego our own 
first-order beliefs ın favor of theirs Even when our own view 1s 
privileged in some way, we should sometimes still defer to the opin- 
1ons of others For example, when I think, based on my own direct 


€ Here ıs Plato 1n the Theaetetus 201c "suppose a jury has been justly persuaded 
of some matter which only an eye-witness could know, and which cannot otherwise 
be known, suppose they come to their decision upon hearsay, formmg a true 
Judgment then they have decided the case without knowledge, but, granted they 
did their job well, being correctly persuaded "—M J Levett, trans (Indianapobs 
Hackett, 1992), p 80 See also M F Burnyeat, "Socrates and the Jury Paradoxes in 
Plato's Distinction between Knowledge and True Belief,” Proceedings of the Anstotelian 
Society, Supplementary Volume Liv (1980) 173-91 John Locke agrees, ın Book IV, 
Chapter 14 of his Essay Concerning Human Understanding, Peter Nidditch, ed (New 
York Oxford, 1975), pp 655-56, he uses the term ‘probability’ to cover all justified 
belief short of knowledge “Probability then, being to supply the defect of our 
Knowledge, and to guide us where that fails, 1s always conversant about Proposi- 
tions, whereof we have no certainty, but only some inducements to receive them for 
true The grounds of it are, in short, these two following First, the conformity of 
any thing with our own Knowledge, Observation, and Experience Secondly, The 
Testmony of others, vouching their Observation and Experience ” 
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experiences, that I am having a heart attack, but then my doctor 
checks 1t out externally and tells me no, that I am only suffering from 
something called “acid reflux,” then I should probably just abandon 
my own prior opinion and adopt his in its place This 1s because he 
is much more likely overall to be right, under the circumstances, than 
Iam In general, if we are rational, we should believe whatever 1s most 
likely to be true, given all the evidence we have It is, 1n principle, 
irrelevant whether the initial source of this belief ıs our own first- 
order judgment or someone else's In a conflict of belief, if we are 
justified in believing that someone else is more likely to be right on 
the question than we are, then we should defer to their opinion As 
a strictly subjectively rational matter, we should “stick to our guns" 
when faced with controversy only if we are justified in believing that 
we are the ones most likely to be right ? 

I should make 1t very clear that I am not suggesting that one can 
think somebody else's thoughts When I adopt through deference 
somebody else's opinion, the belief that I end up with is still definitely 
mine I am not rationally out of the picture just because I have 
deferred to the beliefs of another I am still drawing the best conclu- 
sions that I can from the total evidence available to me, which of 
course includes my own direct experiences and reasoning based on 
them, but also includes my experiences of other sources of mforma- 
tion besides myself At the end of the day, my coming to the best 
overall conclusion may entail giving up my own first-order conclusion 
But ıt 1s still me thinking, from my own traditional, first-person point 
of view So my claim implies no form of “group-think” social episte- 
mology, that ıs, one that takes social groups as basic subjects of belief 
There ts no relativism, or social constructivism, or anything of the sort 
involved ın what I am saying Iam speaking only of the total evidence 
that each mdividual has for his own beliefs, and claiming that this 
evidence can sometimes force a reasonable person mto epistemic 
conformity with others around him 

Still, the principle of rational deference for which I am arguing 1s 
very strong It does imply that there are cases in which, if the practical 
question 1s only what one should believe at the moment, one might 
as well simply shut down one’s own epistemic faculties, and rely 
reflexively on someone else instead In general, it says that person $; 


7 John Hardwig argues along similar lines for a somewhat more restricted prin- 
ciple of deference, to the effect that laymen can have good reason to defer to 
experts, especially ın matters of science He declines to count this reason as a kind 
of evdence See his “Epistemic Dependence,” this JOURNAL, LXXXII, 7 (July 1985) 
335-49 


26 THE JOURNAL OF PHILOSOPHY 


ought to defer to person S with respect to proposition p just in case 
S, has sufficient reason to beheve that 55 1s more likely than $ to be 
right about the truth or falsity of p But ıt can happen that 5, finds 
out, or at least comes rationally to beleve, that Sy 1s always more likely 
to be nght than S, ın which case it looks like S; should Just adopt Sə 
as an authority, and stop trying to figure things out for himself This 
may be true for a certam class of mentally retarded people, for 
example, who come correctly to believe that their own opinions (that 
1s, their own first-order beliefs) about most things are simply not to be 
trusted, and they should rely instead on some set of trustworthy, 
nonretarded people around them A larger class 1s children Almost 
all small children, at least up to five or six years old, including very 
intelligent ones, are in this position with respect to their parents The 
children know enough to know that they are far less reliable overall 
than are their parents, hence that they are rationally best off (though 
perhaps not best off overall) accepting whatever their parents say, on 
almost every subject? Of course, ordinary parents are not really 
especially reliable on every subject, including theology, physics, and 
soon But the young child is in no position to make such distinctions 
From his pomt of view (that 1s, given all of the evidence that he has 
accumulated so far), whatever his parents tell him about virtually 
anything 1s more likely to be true than are the products of his own, 
separate efforts 

Consider an ordinary child's belief ın the existence of Santa Claus 
It may seem that such children must be making some kind of juvenile 
mistake in reasoning to come to such a poorly evidenced conclusion 
But from the subjectively rational point of view, there is usually 
nothing wrong with them at all They are not making any mistakes, 
given the evidence they actually have They believe in Santa Claus 
because their parents have told them that he exists, and their parents 
are the most reliable people that they know Even 1f they find the 
first-order case for Santa's existence pretty unconvincing on its face, 
and even if their older brothers and sisters have explained repeatedly 
to them what suckers they are being, the fact remains that most kids 
still have greater total reason to beheve what their parents are telling 
them, however things otherwise seem It is, ordinarily, only once the 
parents themselves have relented, that a young child gives up this 


8 There are other important features of a person's education beyond coming up 
with the most probable beliefs It ıs commonly m one’s long-term interest to 
develop one’s own critical faculties as well, for example, even at some cost to the 
correctness of their short-term beliefs Perhaps this helps explain why children go 
through phases of unreasonable defiance, starting at a very young age 
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belief, however suspicious or “conflicted” he has become in the 
meantime 

I am not trymg to pose as a psychologist It may be that the best 
causal explanation of why children tend to accept the beliefs of their 
parents involves other forces in addition to the kind of reasonable 
inference that I have described My point is not that they come to 
hold such beliefs entirely because they are rational What I am saying 
1s simply that many such beliefs are rational, and surely this ıs an 
important aspect of the total explanation for their being held 

This kind of reasonable belief in Santa Claus can be clearly distin- 
guished from a case presented annually in the comic strip Peanuts, 
where the character Linus has his own, idiosyncratic belief 1n what he 
calls the Great Pumpkin, on the basis of no evidence whatsoever, 
either first- or second-order Every Halloween, he expects this thing 
to rise out of a local pumpkin patch and bring presents to all of the 
children in the world. And every year, 1t does not happen, and he sees 
that 1t does not happen, and he keeps believing in the Great Pumpkin 
anyway, all by himself I say that this extreme “fideist” Linus 1s 
subjecuvely irrational, where the ordinary Santa Claus believer 15 not 

My point applies to true as well as false beliefs, of course For 
example, I believe that the last Tsar of Russia was a man named 
Nicholas, not because I have any first-hand evidence for this fact, but 
just on other people's say-so If I refused to believe ın this Nicholas, 
Just because I have no direct evidence of his existence, then, given 
what I know about the reliability of many of my sources, I would be 
making a serious error 1n reasoning Or, if I believed on the same 
evidence that there were two such people, Nicholas and another one 
named Freddie, I would also be committing an error In such mat- 
ters—and there are many such matters—we should believe no more 
or less than what we have been told 

Now I want to apply the same basic analysis to religion (more 
specifically, religious fath) in general? The prevailing view 1s that 
religious faith ıs an irrational (or just nonrational) alternative to 
belief based on evidence Explanations of other people's faith. vary 


? Religion and faith are not coextensive There are plenty of traditional and other 
second-hand beliefs that have nothing to do with religion, and there are sources of 
religious belief that are not second-order For example, many people claim to have 
their own, autoempirical religious experiences, anywhere from hearing vividly the 
voice of God, to the vague “oceanic feeling” that Sigmund Freud talks about There 
are also some science-like ingredients of religion, including philosophical discus- 
sions among theologians But the thing that really makes religion or religious faith 
what it 1s, an institution as opposed to a mere set of doctrines, 1s 1ts transmission 
through testimony from one generation of believers to another 
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from hope, fear of death, and the like to political “brainwashing,” but 
the common thread ıs that believers are persuaded by essentially 
psychological or social, not epistemic, forces 1° But I say that, if 
someone grows up 1n a society in which his parents and everybody 
else believe in remcarnation, for example, then, if he ıs rational, he 
will defer to all of his elders and believe in reincarnation, too He may 
have no direct, first-order evidence for this new (though he may or 
may not have been taught that some things count as first-order evi- 
dence for it), but this makes no practical difference Given the 
evidence that he actually has, he must conclude that those around 
him are much more reliable, at least on big public issues, than he 1s 
himself Therefore, if he wants to believe what 1s most likely to be 
true, he ought to accept whatever these trustworthy people, his 
elders, tell him !! If what they say makes httle sense to him, then that 
1$ too bad—just as ıt is too bad for me that I cannot make heads or 
tails of quantum mechanics 

We say that people ought to “think for themselves ” But what would 
that be like in these cases? Should I do my own particle physics? If 
not, then why should every member of a traditional society do his own 
metaphysics? How could it be rational for him to evaluate the cosmic 
situation on a purely first-order basis, and then accept his own 
restricted conclusions 1n preference to the unanimous testimony of 
the most trustworthy people he knows? To think that one can figure 
this sort of thing out for himself would imply that he believes that his 
own first-order epistemic machinery 1s somehow more reliable than 


10 The biologist Richard Dawkins gives a particularly blunt and hostile statement 
of this view “I think a case can be made that farth 1s one of the world's great evils, 
comparable to the smallpox virus but harder to eradicate Faith, being behef that 
isn’t based on evidence, 1s the principle vice of any religion And who, looking at 
Northern Ireland or the Middle East, can be confident that the brain virus of faith 
1$ not exceedingly dangerous?"—"Is Science a Religion?" The Humanist, tv (Jan- 
uary-February 1997) 26-29 He goes on to describe ordinary religious education as 
a form of “mental child abuse” Most opponents of religion are at least more 
diplomatic 

11 Alston discusses a similar case (op cit, pp 95, 145-46) and argues that a person 
who grows up inside a culturally “isolated community” cannot be blamed for adopt- 
mg the peculiar traditions of his elders, in part because he will never have consid- 
ered questioning them In this weak sense, Alston allows that such people can be 
“deontologically justified" ın their beliefs (his overall point ıs that such a conception 
of justification 1s faulty, precisely because 1t leads to such cases) The position that 
I am suggesting here is much stronger I say that ordinarily, a culturally isolated 
individual ıs (from a subjectively rational point of view) positively obligated to believe 
in the local traditions, even 1f he has considered them quite thoroughly, and even 
if he directly perceives what 1s wrong with them The only possible exception 1s if 
the individual has sufficient total reason to believe that he 15 more rehable on the 
subject than the sum of his elders 
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everybody else’s Unless he 1s an extremely unusual person, he has no 
evidence for this at all Even if 1t strongly seems to him that he has 
good direct evidence for his own, different first-order view, and that 
everybody else 1s screwing up somehow, his knowledge of their overall 
epistemic record and abilities, compared to his own, should surely 
force him to defer Those few people who refuse to defer under such 
circumstances can be said to suffer from the second-order form of 
irrationality which 1s usually called arrogance The more common 
attitude toward doctrines that may not seem directly to be right, but 
are adopted on good grounds of testimony, 1s what Christians since 
Augustine often really mean by fath, and what I am calling second- 
order rationality !? 

This 1s what explains the great stability of religion and other 
traditional beliefs It 1s because human beings are usually quite 
rational in the subjective sense—that 1s, we make about the best sense 
that can be made of the total evidence we have, including second- 
order evidence—that normal people growing up in traditional soci- 
eties so “uncritically” absorb the views of those who went before them 
The common prejudice among scientific Westerners, that generation 
after generation of highly civilized Egyptians, Chinese, Hindus, Incas, 
Moslems, and the like are all somehow deficient 1n their reasoning, 
shows its absurdity, I think, when placed against this reasonable 
alternative 

Here 1s a major objection, first posed to Augustine’s view by Hume, 
in his famous essay on miracles !? What if a religious claim is so 
bizarre that the probability of even unanimous testimony 1n its favor 
being true 1s lower than the probability that everybody else 1s simply 
wrong? Hume claims that miracles are, by definition, events of very 
low probability, since they violate what seem to be the laws of nature 
To be persuaded by second-order evidence of miracles, then, requires 
that one assign an even lower probability to the proposition that the 
testimony ın question has somehow misfired, through past deceit, 
ill-will, undue credulity, mistaken perception, or any other fallibility 


12 Augustine makes this testimonial notion of faith a central premise of his 
conversion to Chnistianity In his Confessions, Book VI, Chapter 5, he says “I began 
to realize that I believed countless things which I had never seen or which had taken 
place when I was not there to see—so many events in the history of the world, so 
many facts about places and towns which I had never seen, and so much of what I 
believed on the word of friends or doctors or various other people Unless we took 
these things on trust, we should accomplish absolutely nothing 1n this life Most of 
all it came home to me how firm and unshakable was the faith which told me who 
my parents were, because I could never have known this unless I believed what I was 
told"—R S Pine-Coffin, trans (New York Penguin, 1961), p 117 

13 An Enquiry concerning Human Understanding (New York Oxford, 1957), chapter X 
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of our reliance on others Hume asserts, quite reasonably, that in the 
case of a conflict, the real probability of any human authority’s being 
correct will never be as high as the probability that the usual apparent 
laws of nature are 1n force 

From an external point of view, Hume ıs correct about these 
probabilities So his argument 1s valid, objectively speaking But he 1s 
not considering every ingredient of the subjective probabilities ın- 
volved, when an individual 1s actually confronted with a question like 
this In particular, he leaves out what the individual may justifiedly 
believe about his own ability to figure such things out Even if the 
individual has read Hume’s argument himself, and finds it very 
persuasive on its face, he does not know for sure that he has under- 
stood ıt fully If he then shows Hume’s essay to his parents, teachers, 
and the like, and they all say that 1t does not work, this counts as a 
powerful, subjective argument against it for hem, even if he cannot 
grasp their actual objections 

But what if the religious (or other second-hand) belief ın question 
contains, not an improbability, but an outright contradiction? If 
rationality 1s supposed to be a useful thing, must not the rational 
person be allowed to apply autonomously at least some basic tests for 
coherence? The answer 1s “yes” and “no” Rationality does plainly 
require that one’s overall view of things be basically coherent But 
this does not entail that each belief that a person has must make sense 
all by utselfto that person, when this conflicts with rationality’s demand 
that one make the best available sense of one’s total epistemic situa- 
tion If everything that your elders ever told you had struck you as 
highly improbable in the first place, then you would indeed have 
good reason to doubt the next implausible suggestion that you hear 
from them, based on their previous performance But, to whatever 
extent they have established their reliability (1n the relevant subject 
matter) to you, you have that much inductive reason to believe 
whatever next thing they assert, however oddly ıt strikes you If they 
have established a degree of relevant reliability that 1s superior to 
your own, as 1s usually the case for parents and their children, then 
you can easily end up with better overall reason to defer to what they 
tell you than to believe something else that appears to make far better 
sense Ifsuch a situation causes some local inconsistencies to crop up 
within one's set of beliefs, this 1s unfortunate, but often unavoidable 

Think about the problem of evil You grow up believing, on the 
basis of the testimony of your parents and other adults, in an all- 
powerful, all-knowing, and completely benevolent God One day it 
occurs to you that the existence of such a God 1s frankly contradicted 
by the pointless suffering experienced by millions of innocent'people 
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(and other creatures) every day Now, what should you think? I say 
that you are faced with a choice that ıs, ın principle, still pretty simple 
Should you continue to trust your elders in this matter, or should you 
trust yourself instead? Admittedly, you have found yourself in a 
certain psychological state—the state of being persuaded by an argu- 
ment that a certain complex proposition 1s incoherent But is this a 
reliable indication of the truth? Has something suddenly made you 
an expert on what 1s and is not actually incoherent, as distinct from 
what merely seems to be incoherent? Suppose that your parents, 
priests, and others now tell you that they are aware of your argument, 
and assure you that it can effectively be countered (through a very 
subtle rejoinder, or perhaps through some other process that goes 
“beyond” argumentation) As long as you understand that your own 
faculty of reason 1s not a superior indicator of necessities and possi- 
bilities, then you will have little more total reason not to defer to your 
parents now than you did before the problem occurred to you This 
1s so, moreover, even if you turn out to have been right, and the 
problem of evil 1s 1n fact objectively devastating to traditional con- 
ceptions of God—indeed, even if you “know” that it 1s so, 1n the sense 
that you are aware of the correct first-order path to that conclusion 

The same point holds outside of religious or traditional beliefs I 
have a friend—a philosopher, not a physicist—who claims to have 
proven that relativity theory ıs incoherent He may, for all I know, be 
right But I am sure that this person 1s not justified (in the subjective 
sense) m holding this as a belief, if he really does It almost does not 
matter whether relativity actually 1s mcoherent, or how good this 
fellow’s argument 1s, objectively speaking Once he has shown it to 
the experts, and they have turned 1t down, he simply must defer to 
them, if he ıs to be rational about ıt I am in roughly the same 
situation myself, with respect to the coherence of quantum mechan- 
1cs— except I realize that I am in no position to judge the matter, so 
I just accept the experts’ view that this theory ıs correct It seems 
selfcontradictory to me, but that ıs not sufficient reason for me to 
disbelieve a theory, unless ıt ıs so frankly incoherent that I cannot 
even frame it in my mind And even then, it may be rational for me 
to accept that such-and-such a theory, which I cannot even mentally 
express, 1s nevertheless probably true 

There 1s indeed a limit to how crazy one’s beliefs can be, but it 1s 
a pragmatic, not an a priori one If I believed that all amounts of 
money are the same, for example, then I would hkely fail ın life from 
that belief If I believed that hydrochloric acid 15 health giving and 
delicious, then I might well die Beliefs that seriously interfere with 
the conduct of ordinary life are naturally hard to establish in the first 
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place, and will be equally hard to maintain ın the face of a recalcitrant 
world This 1s why many of the most bizarre-seeming tenets of other 
people's religions have to do with primal history, the afterlife, and 
other matters outside the scope of daily existence Still, it 1s instruc- 
tive to remember what some people actually put themselves through 
in terms of daily inconvenience, not to mention some extreme forms 
of self-denial and even self-mutilation, ın order to live consistently 
with their traditional beliefs 

As John Rawls has said, the proximate goal of the exercise of reason 
is a kind of equilibrium Each thing that a rational person expert 
ences, first-hand or second-hand, must be processed as a piece of 
evidence about the world, factoring in whatever one knows about the 
reliability of its source In order to make the best sense of the whole 
collection, one may well have to sacrifice something of the best sense 
to be made of some parts What we seem to perceive with our senses 
usually counts for a lot, because we have good reason (much of it 
testimonial) to believe that our senses, especially our eyes, are very 
relable What we can figure out by pure, first-order reason also 
counts a great deal, for similar reasons, but not necessarily decisively 
1n any case where there 1s contrary evidence as well And testimony 
from any source not known to be deceptive also counts—sometimes 
less than other kinds of evidence, depending on one's total situation 
and experience with other people, but sometimes more 

Hence my point about traditional societies when available testi- 
mony on some point ıs truly unanimous, its epistemic weight can 
simply overwhelm the other resources, a priori and otherwise, of the 
individual, rational mind The most intelligent people in the world 
may end up believing almost anything, if ıt 1s part of an overall theory 
of the world that makes the best sense out of each individual's total 
experience, including everything that other people say to them 
Religious faith 1s not, then, just an irrational “opiate of the masses " 
It 1s, essentially, the social realization of second-order rationality 
These beliefs may have arisen znztzally through people's desire to have 
a comforting story about death, to control others, and so on—or 
perhaps some people really witnessed miracles But once these beliefs 
have been established, no further “backwardness” of any kind 1s 
required for their maintenance, and no political sort of repression, 
either Once the process takes sufficient hold ın any society, ıt will 
tend to persist indefinitely, from each generation to the next, as new 
members are raised into ıt Of course, it ıs bound to happen that 
some people grow up deferring to beliefs that they do not find very 
plausible, or even fail entirely to understand, like many Catholics do 
not understand the doctrines of the trinity and transubstantiation 
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Such obscure beliefs might possibly be viewed as errors that have 
gotten entrenched, and that the epistemic community sumply has no 
way of correcting from inside !* J am not denying that it 1s a bad thing 
when mistakes or inconsistencies get passed on like this My point 1s 
that when this happens, ıt 1s not because the people involved are 
irrational, in the subjective sense It 1s because they are rational 

Unfortunately, too, a certain amount of flat-out intellectual repres- 
sion does seem to make sense from within a traditional society If all 
the members of a community are forced by rationality to adopt some 
belief, due to the great epistemic weight of testimony in its favor, then 
anyone who attacks that belief 1s ipso facto behaving unreasonably It 
1$ not Just raw power, then, which argues for the suppression of 
opposing views within traditional societies. Since the dissenting views 
themselves are rationally wholly unbelievable (though not for intrin- 
sic reasons), ıt ıs hard to see from inside such a community why 
reasonable people should have to put up with them— especially in 
legislatures, schools, and other public theaters, where everyone 
agrees that reason ought to rule Even in a nontraditional, modern 
society like ours, such repression may seem appropriate where una- 
nimity ıs nearly total For example, those who now allege that the 
Holocaust never took place are (and arguably should be) denied 
nearly all forms of public support for the dissemination of their view, 
not simply because the view 1s false, or because it 15 nasty, but because 
such a view can only be held zrationally, in light of the great mass of 
physical and especially testimonial evidence to the contrary 

Here 1s a second objection, or a possible restriction My account of 
rational deference so far assumes something close to unanimous 
testimony on the part of one's elders or one's whole community But 
many religious and other traditiona] believers maintain their views 
despite conditions of controversy, even as members of tiny minority 
groups Can at least this kind of deference—to local authorities, in 
the face of known disagreement from others— be counted as 1rration- 
al? I say perhaps, but not necessarily It will depend on the immediate 
epistemic conditions for the individual believer For example, if a 
child is raised by evidently highly trustworthy parents and others 
whom. he knows, both to accept some minority belief and strongly to 
distrust the majority that disagrees, then that person's belief may be 
rationally stable for a long tume—perhaps his whole hfe if he ıs 
sufficiently “protected” from the outside world This is certainly a 


14 The process can be interrupted by an external force, of course, such as a 
conquering army But even then, traditional beliefs are terribly hard to exiinguish, 
and frequently outlast, or even absorb, such conquerors 
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common pattern with minority religious groups like the Amish or 
Hasidic Jews, or other fairly self-contained epistemic communities 
But there is also a tendency, perhaps a stronger one 1n very mixed 
socreties like ours, for such minority beliefs to break down (that 1s, to 
stop being rationally justified) under the pressure of conflicting 
testimony from “outsiders” who have nevertheless shown themselves 
to be reliable sources—at least, not too much less reliable than the 
believer’s original community Travel, military service, and college 
education have long served this dissolusive role, by confronting pre- 
viously sheltered individuals with evidence that many nonbelievers 
can ın fact be trusted 

Descartes, experienced himself in all three modes of outside con- 
tact, raises this issue of conflicting testimony in the Discourse. He 
claims that such conflicts of authority tend to cancel out the total 
weight of testimony 1n those situations, throwing the individual back 
on his own resources !5 For example, if a third of my elders say that 
Odin is the king of the gods, and another third say that 1t ıs Zeus, and 
the last third say that there are no gods at all, then obviously I cannot 
take all of these propositions equally on faith. Let me then just ignore 
the whole lot of them, and do what I can to figure the situation out 
for myself Perhaps I shall end up agreeing with one side in the 
ongoing dispute, or perhaps I shall come up with my own new theory, 
for example, that there 1s only one god, or that the gods are demo- 
cratic In any case, according to Descartes, I should rationally come 
to believe whatever theory makes the most sense to me on first-order 
grounds, since the pool of second-order information I received was 
effectively nullified by the contradictions it contains 

I disagree with Descartes about this ?ê I think that the rationally 
correct procedure 1s to assemble all the candidate theories one can 
find, and then place the greatest faith ın the one most hkely to be 
true, all things considered One’s own first-order view of things 1s 
never more than one of these choices And when it 1s opposed to the 
unanimous testimony of his elders, one’s own view 1s almost always, as 


15 See Discourse on the Method The Philosophical Writings of Descartes, Volume 1, 
John Cotungham, Robert Stoothoff, and Dugald Murdoch, eds and trans (New 
York Cambndge 1988), part one Also at the beginning of the Meditations, he says 
“considering how many diverse opinions learned men may maintam on a single 
question—even though it ıs impossible for more than one to be true—I held as 
well-nigh false everything that was merely probable"—The Philosophical Writings of 
Descartes, Volume 1, p 115 Descartes sees the need for a first-order criterion of 
rational behef (ultimately clear and distinct perception) as the only alternative to 
skepticism 

16 But I do agree with Hardwig (op ct, p 343) 


RATIONALITY OF SCIENCE AND RATIONALITY OF FAITH 35 


I have said, less likely to be true But this 1s equally the case if one’s 
own view 1s opposed by the divided testimony of his elders, because he 
is still less likely to be right than any of them Here ıs a simple 
example Plato says X about some philosophical issue, and Aristotle 
says Y The best thing I can figure out on my own ıs Z Now, what 
should I believe? Certainly not Z, unless I have good reason to think 
that I am as reliable a philosopher as either of the other two From 
a strictly rational point of view, I should conclude that the correct 
answer 1s likely to be X or Y (or something else that I am not aware 
of), and acknowledge that I am not in a position to decide between 
them My own view may be interesting to me, and I may gain in 
various ways from the exercise required to produce it, but I am not in 
a position to judge that my own, personal opinion has any greater 
virtue than its plausibility to me 

The inductive conformism that I am recommending does have 
some limits I do not wish to suggest that a person should automat- 
ically believe whatever the mere majority around him believe If more 
philosophers believe in theory A than theory B, this may make theory 
A somewhat more likely to be true, ceteris paribus But this relative 
likelihood does not itself suffice to meet any reasonable standard of 
belief Only when there is an established consensus on an issue i$ 1t 
normally reasonable for someone to accept the judgment of others 
without further question But at the same time, it 1s only in the 
unlikely case of a total breakdown of testimonial authority that one 1s 
ever truly thrown back on his own epistemic resources If the dis- 
agreement among your elders reached the point where you had 
reason to Judge that none of them was more reliable than yourself, 
then your own opinions could properly be seen as at least equally 
worthy of belief, compared to the theories of others But again, 1t 1s 
only ultimately rational for you to prefer your own opinions if you 
have reason to prefer yourself as a source of opinions If no one in 
the world, including you, stands out as a reliable authority on the 
relevant questions, then the rationally best thing to believe ıs nothing 
at all 

At the same time, there 1s undeniably a certain intellectual freedom 
that results from such an absence of coherent authority If, as sug- 
gested below, it 1s natural for people to want a picture of the world 
that 1s reasonably complete, at least where action 1s likely to be 
required, then even in cases where a suspension of belief is rationally 
favored, the desire to believe may take control psychologically This 
may explain why first-order reasoning seems naturally to thrive in 
those complex epistemic situations where traditional authority has 
broken down, and new beliefs, including quite bizarre ones, seem to 
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arise and flourish with only the flimsiest objective evidence in their 
favor People anxious for a theory, but deprived of all authoritative 
guidance, might simply grab whatever 1s most psychologically attrac- 
tive, on the rationally nondecisive grounds of first-order plausibility to 
themselves 

In ordinary intellectual life, where authonity 1s less than absolute 
but holds some sway, first- and second-order justifications for belief 
are all mixed up Our belief in our own intellectual powers 1s largely 
conditioned on our accepting the authority of various other people 
who tell us how reliable they think we are, and how reliable the 
various other authorities are This judgment, ın turn, affects our 
estimation of the reliability of each of them, and everybody else For 
us to develop the best possible picture of the world, our conceptions 
of our own intellectual strengths and weaknesses must come into 
equilibrium with our beliefs about everybody else's, and in turn with 
our beliefs about the actual objects of everyone’s opinions 

This 1s why it 1s almost never sufficient to justify upholding a 
dissenting opinion, merely to come up with a coherent first-order 
explanation as to why the orthodox opinion 1s wrong Someone who 
believes that everyone else ıs a devil, for example, has a consistent 
view, which accounts coherently for everyone else's testimony to the 
contrary Rationality requires, not just that one construct some coher- 
ent account of one’s entire evidential situation, but also that one 
make the best total sense of 1t And 1t 1s highly unusual for the best 
overall theory to be one in which first-order evidence outweighs 
anything like unanimous testimony to the contrary 


III THE OBJECTIVE RATIONALITY OF SCIENCE 
We tend these days, as I have noted, to think of empirical science and 
associated attitudes toward discovery and truth as normal, and of 
religious belief and other kinds of "folk theory" as representing a 
failure of fundamentally scientific rationality But the truth is the 
other way around Science js rare in history, for the reason that ıt 
requires, in effect, the suspension of ordinary individual rationality, ın 
the ultimate pursuit of knowledge that 1s largely unavailable to its 
immediate discoverers In general, for each basic scientist or other 
intellectual explorer, the probability of his own new theories turning 
out to be true will be far lower than the probability that the received 
view of the time, or some other person’s innovation, will be proven 
right The young child who announces, on the basis of his own 
observations and deductions, that Santa Claus does not exist, despite 
his parents having insisted that he does, may be said to be to be nght 
in the sense that what he says 1s true, and 1s backed up with what turn 
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out to be good first-order reasons But he is also wrong, 1n the sense 
that he has no justification for preferring his own overall judgment to 
that of his parents 17 The same goes for scientists and other intellec- 
tuals who wish to challenge the beliefs of their own authorities For 
science to work requires that scientists 2gnore the epistemic weight of 
much of what they hear from those around them, since a rational 
respect for such testimony can place the subjective probability of any 
very new hypothesis at close to zero 

Here ıs an example, simplified a bit to illustrate my pomt Galileo 
is a hero of science, but may not have been an altogether reasonable 
man He certainly defied his elders (if the church authorities of his 
day can be counted as such), not Just by disagreeing with received 
opinions, but also by denying that religious or social authority should 
carry any weight in matters of science Galileo had the telescope, of 
course, and access to other recent developments, as well as the fruits 
of his own reasoning But so did his opponents, who never seriously 
denied that astronomical appearances were as he clamed. The main 
issue between them was one of how to decide, and who gets to decide, 
what these appearances amount to ın the greater scheme of things 
And Galileo was, for much of his working life, openly contemptuous 
of the “ignorant and superstitious” beliefs of the authorities around 
him He was ultimately brought to trial by the church, gently impris- 
oned, and required to renounce his unorthodox views Now, the 
common perception is undoubtedly correct, that Galileo was the 
premier founder of the modern scientific method, and for this he 
certainly deserves our thanks He also authored many particular ideas 
which turned out to be night, or at least important improvements on 
the going theories of his day So we can say that his opinions were 
rational ones, in the objective sense But was he justified subjectively, 
in believing these new theories himself? There are grounds to think 
that he was not, because he never had sufficient overall reason to 
prefer his own approach to the received view of the Church. If he 
thought that he did, this would have required him also to believe that 
he, Galileo, was a better judge of the entire situation, encompassing 
both science and religion, than were the church authorities who 
argued that reliable scriptures ruled out certain of his physical hy- 
potheses And it i5 not clear how he could have had sufficient reason 
to believe this claim about himself (even if ıt was, objectively speak- 
ing, largely true) He must have known that he was very bright But 


17 Again, I am only speaking of purely rational justification There may well be 
other good reasons, perhaps developmental ones, for children to be inclined to 
reject their parents’ testimony 
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unless he believed justifiedly that he was brighter than everybody else, 
or privileged in some other general way beyond the resources of his 
elders, there was no adequate internal reason for him to prefer his 
own authority, as ıt were, to theirs We are glad that he did, of course, 
because he turned out to be mght, at least approximately, both on 
substance and on scientific method But he himself could not have 
known that Far from being a martyr to reason, as he is usually 
portrayed, he was a man whose total epistemic situation. made it 
rational for him to recant when he was finally ordered to do so !? 
In the practice of science, subjective and objective rationality part 
company The fact that radical inquiry of the sort that Galileo prac- 
ticed 1s ordinarily irrational for individual thinkers makes it unlikely 
that science should ever emerge inside of a traditional society But 
the fact that empirical science works in the long run—that 1s, leads to 
more true beliefs, providing for better technology, weapons, and so 
on—makes it hard to kill once it gets started It spreads, not because 
it makes the most sense for the individuals who practice it, but 
because the societies ın which it has taken hold tend to be more 
successful than those in which it has not A scientist devotes his career 
to the development of some new theory Is it likely to be true? No 
Each individual scientist 1s probably just wasting his time, if the goal 
is merely to increase his own stock of true beliefs !? But there are 
many scientists working at once, on many hypotheses If any of them 
prove to be correct, then all of the consumers of the scientific 
product benefit, albeit at the expense of unsuccessful scientists, to the 
extent that the latter have convinced themselves of theories that will 
turn out to be false ?? Thus, scientific reasoning appears to involve a 


18 This ıs only the first layer of a full analysis of Galileo's situation In real hfe, 
Galileo was not truly alone, and his opponents were hardly unanimous 

19 Though one’s knowledge, ın the sense of understanding, 1s again a different 
matter The practice of philosophy or science may well enhance one’s understand- 
ing of the world, even while diminishing one’s chance of being merely right A 
“defiant child” like Karl Marx or Freud, for example, might be seen as having 
gained a deeper overall appreciation of his subject, while being less correct than 
prior common sense about most points of fact Such comparisons are, of course, 
1mpossible to quantify 

20 What I am saying obviously applies more directly to what Thomas Kuhn calls 
revolutionary science than to his “normal” science It 1s not, in general, the 
incremental advances, in accordance with accepted paradigms, that require sus- 
pended rationality on the part of their authors, it ıs big leaps of genrus, in the face 
of direct opposition from one’s elders In ordinary life, professional scientists and 
other academics are in fact encouraged to function semrautonomously, on the 
basis of their mastery of authorized procedures And this 1s not so different from 
the authority structure of the Church 1n Galileo’s time, which also allowed for 
widespread scholarship and speculation, in that final authonty still resides very 
firmly at the top of each profession In the recent controversy over “cold fusion," for 
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kind of epistemic altruism on the part of individual scientists, 1nten- 
tional or not?! By refusing to defer to others when it 1s subjectively 
rational for them to do so, they sacrifice the greatest likelihood of 
having true personal beliefs, for the sake of producing better beliefs 
for everybody ın the long run ?? 

This is true not just for empirical science Intellectual progress in 
general results from the same kind of altruism New ideas ın ethics, 
cultural life, and even art must be launched initially in. the face of 
such great second-order evidence to the contrary, that 1t makes little 
sense for their proponents to have any confidence at all in such 
hypotheses The autonomous thinkers themselves are likely to lose, 
from a strictly rational point of view, whenever they actually believe in 
their own new ideas But their successors, and their societies as a 
whole, gain much more when one of their hypotheses (of whatever 
sort) proves to be true or useful 7° 


example, a small group of highly credentialed university scientists was decisively 
rebuked by professional organizations (including funding authorities) when it was 
decided that they had not correctly followed authorized procedures For more on 
this case, and a far more detailed discussion of how authority actually works in 
science, see Philip Kitcher, "Authority, Deference, and the Role of Individual 
Reason,” ın Ernan McMullin, ed The Social Dimensions of Science (South Bend, IN 
Notre Dame, 1992), pp 244-71 

21 Tt 1s controversial among both psychologists and biologists whether altruism 1s 
truly possible Some (for example, Elliott Sober and David Sloan Wilson) believe 
(following Charles Darwin) that altruism can evolve through the mechanism of 
group selection, where groups of cooperatively self-sacrificing organisms are able to 
outperform other groups comprising individually fitter, but less mutually coopera- 
tive ones—Unto Others The Evolution and Psychology of Unselfish Behavior (Cambridge 
Harvard, 1998) This is an inviting model for the notion of objective rationality that 
I am suggesting 1s the basis of science, but my point does not depend on such a 
theory being right I am only clamming that science tends to progress against the 
purely epistemic selfanterest of individual parucipants, ın favor of the epistemic 
interests of others As with other species, it 15 not required that this type of altruistic 
behavior in humans be recognized as such by its performers In any case, scientists 
are usually compensated ın a variety of nonepistemic ways for their efforts, so 1t 1s 
arguably still in their overall sel£interest to do what they do 

?? I reiterate that I do not mean to imply that ıt ıs only whole societies (as opposed 
to individuals) that can know (or justifiedly believe) things, or any such radical 
thesis When I say that a society gains ın knowledge from the efforts of researchers 
who cannot know whether their hypotheses will be proven correct, I simply mean 
that future individuals in that society can know what their forebears could not, once 
an appropriate consensus 1s reached 

23 Not that science and simular intellectual activities are unstoppable once they 
get established Many times, societies have reached philosophical or technological 
plateaus, as in ancient Rome and China or medieval Japan, where a period of 
exploration gives way to some new set of orthodoxies Even in the modern West, 
various forces periodically press for slower change, consolidation, or even a return 
to “basics” of various sorts 
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It does appear that people naturally prefer believing something to 
believing nothing As Hume says “Nature, by an absolute and un- 
controllable necessity has determin’d us to yudge as well as to breathe 
and feel "?* It ıs easy to see how the need for action ın a largely 
hostile world entails a need for some kind of theory of that world, 
even 1f the only available theories are unjustified Hence, rationality 
itself may be of limited survival value A creature who found it very 
easy to suspend judgment whenever evidence was incomplete or 
contradictory would probably be less fit for survival than another who 
was more ready to jump to conclusions—for example, in the matter 
of whether one was about to be eaten by a tiger 

But it 15 still not absolutely necessary for philosophers and scientists 
to believe their own theories, or believe anything at all on a given 
issue °% Intellectuals of all sorts can distinguish between the role of a 
creator or advocate of ideas, and the role of a disinterested yudge of 
the truth What 1s a good means of intellectual production, and what 
are good grounds for belief, are really very different questions Phr- 
losophers and scientists are trained to produce new intellectual ma- 
terial that can be ground up and examined by others, and to engage 
1n controversy 1n a spirit of healthy competition But a fully rational 
person ought to withhold belief from hypotheses that have not yet 
been established, or at least made probable, regardless of his own 
personal role in generating or defending those propositions To 
think that one’s own products are especially deserving of assent—this 
1s where a potentially useful, speculative posture turns into hubris It 
is always possible to doubt, at least “metaphysically,” even those 
thoughts which one finds most psychologically compelling For a 
normal person can resist assenteng to anything he knows ıs 1mproba- 
ble, as we do when we find out that we are hallucinating, or that the 
proverbial stick in the water really 1s straight Proper internal use of 
the phrase ‘it seems’ should be adequate, though 1t must be sincere 


24 A Treatise of Human Nature (New York Oxford, 1967), I IV I, p 183 Here he 
means to judge in general, not to make any particular judgments But he also 
clearly states that beliefs about particular obyects can sometimes be “extorted by 
the present view of the object," though he says further that this implies no “dog- 
matical spirit” or conceit (bd , I IV VII, p. 274) 

25 Most likely, we have evolved a differential mechanism, leaving us best able 
rationally to suspend judgment 1n those matters where action 1s least likely to be 
required This may explain why most of us seem to need a full set of political 
opinions, but few of us feel compelled to have any belief as to whether there 1s life 
on other planets It may also explain why philosophers and other sages are 
expected to withdraw, to some extent, from the concerns of ordinary hfe In order 
for our rationality to reign in all matters, this natural need to believe, based on 
expected needs for action, must be to some extent suppressed 
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For example, a raven may look totally, convincingly black to me, but 
nothing forces me to conclude from this that the raven really 1s black, 
since I always have the option of concluding, in the face of any sort 
of evidence to the contrary, that 1t merely seems to be black 

Present-day physicists have the best possible attitude, I think, 1n not 
expecting themselves, individually, to be the ones who will turn out to 
be right They have seen so many large and small “paradigm shifts” 
occur, even within the span of a single career, that they have learned 
to be quite skeptical of current theories Sull, they engage in their 
work with the zeal of people very close to achieving all that they 
desire Their suspension of personal judgment does not dampen 
scientific enthusiasm for them, much as this would have surprised the 
ancient skeptics ?9 Perhaps the difference 1s that individual physicists 
do expect to see new pieces of the truth keep emerging through the 
institutional inquiry ın which they play a part, so they consider their 
suspense to be essentially temporary These scientists can see them- 
selves both as producers of relatively raw ideas, and as potential 
consumers of a largely finished product 

As for the ordinary, modern consumers of ideas, they must respect 
the great success of science overall, but also pay attention to the 
frequent overthrow of current theory Their most reasonable attitude 
bets heavily on science in the long run, but against the view that this 
or that particular theory 1s the whole truth of the matter Many highly 
educated people come from a partially traditional background as 
well, and this often creates special epistemic problems for them—for 
example, in the apparent conflicts between biblical creationism and 
modern geological and evolutionary theory It may be that, as be- 
tween scientific and religious authority, some such individuals will 
have enough total evidence to “fall” one way or the other, while 
others are left with no clear, reasonable choice at all, and may suffer 
greatly from the cognitive (as well as social) strain that this creates 
Ironically, widespread attacks on the rationality of faith will tend to 
push some people out of this uncertain state by second-order means 
alone, while pushing others into it 


IV CONCLUSION 
Here ıs a final objection to my view It can be claimed that I have no 
business asserting my thesis about controversial beliefs, given the fact 
that the thesis itself 1s novel, and bound to be controversial if dis- 


26 For example, Kip S Thorne talks about the bets he makes with Stephen 
Hawking over controversial physical hypotheses—Black Holes and Time Warps Em- 
stem's Outrageous Legacy (New York Norton, 1994), p 521 He does not automati- 
cally bet on the truth of his own work 
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cussed at all Thus I am caught in an apparent paradox if what I say 
1s true, then by my own arguments I ought not to believe ıt It does 
not make sense to be the only conformist, as it were, ın town But 
there 1s really no paradox here, since I desire to assert my thesis only 
hypothetically—not as a report of my own firm beliefs, but as a mere, 
suggested avenue of inquiry for others to pursue and evaluate I have 
first-order reasons to believe in my conclusions, which I have given 
above IfI did not find the view pretty convincing myself, I would not 
have bothered writing ıt all down But I really do not know what other 
philosophers are going to think about these arguments in the long 
run If there are glaring errors, I may find that out pretty soon But 
if my thesis 1s true, I doubt that I shall ever know this, because most 
other philosophers will not agree very quickly, and I already know 
that my own opinion ın philosophy, while modestly reliable as such 
things go, 1s not especially to be trusted 
Perhaps the only point on which there 1s a true consensus among 
philosophers ıs the principle of epistemic self-reliance 1tself—which 1s 
what I have been questioning But even if I were attacking this 
principle head-on (for example, by arguing for a straight, majoritar- 
1an conformism), I would still be obeying it, since this attack would be 
my own 1dea 2” In any case, ıt has not been my purpose to attack the 
idea of intellectual autonomy itself. Instead, I have been trying to say 
what ıt really amounts to, and how ıt 1s best to be employed—that ıs, 
in the experimental, “brainstorming” spirit of contemporary physics 
So my position 1s a little awkward, perhaps, but no more paradoxical 
than that of any other intellectual who 1s working with the proper 
attitude toward his own, tentative results 
THEODORE J EVERETT 
State University of New York/Geneseo 


27 Tt is more common to encounter the implicit conformist, who unquestioningly 
asserts the principle of "thinking for yourself,” just because everybody else does 
The locus classecus for this point ıs the famous balcony scene ın Monty Python’s film, 
The Lyfe of Bran, where the reluctant savior Brian commands his followers to think 
for themselves, and they shout their agreement 1n unison 
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Unto Others The Evolution and Psychology of Unselfish Behavior ELLIOTT 
SOBER and DAVID SLOAN WILSON Cambridge Harvard University 
Press, 1998 394 p Cloth $33 50, paper $17 95 


Unto Others 1s an intellectual joyride, by turns speculative and pioneer- 
ing, cautious and carefully argued, scholarly and judicious, self-right- 
eous and strident, generous and evenhanded, and defensive and 
partisan 1n advancing its central themes. the undeserved neglect of 
group selection as an important mechanism of evolutionary change 
making possible the evolution of altruistic traits (chapters 1-3), the 
special significance of this mechanism for the evolutionary history of 
human beings (chapters 4-5), and the evolutionary considerations 
favoring the inclusion of altruistic 1mpulses among our ultimate 
psychological motivations (chapters 6-10) 

Elhott Sober and David Sloan Wilson have been putting group 
selection’s best foot forward for quite some time now, and they 
effectively confront a number of confusions about the issue, most 
importantly, what they call the averaging fallacy defining altruism and 
group selection out of existence by averaging the fitnesses of genes, 
traits, organisms, and the like across the various environmental cir- 
cumstances they encounter (including membership in groups with 
varying proportions of altruistic members), noting that it 1s the 
entities with the highest average fitness which evolve, and ignoring the 
level of biological organization at which the relevant selective pro- 
cesses actually take place Using now-familar examples (disease vır- 
ulence, female-biased sex ratios, meiotic drive, and so on), they show 
how selective pressures can coexist and even conflict at different 
levels of biological organization most important for them, even an 
altruistic trait conferring a selective disadvantage upon its bearers 
within each isolated subgroup of a population will increase in fre- 
quency in the global population through a competing process of 
selection at the level of groups themselves by the operation of Simp- 
son's paradox—here that groups contaming higher proportions of 
altruists contribute more offspring to the next generation of the total 
population even as the frequency of altruists declines within each 
group—:f the group benefit is sufficiently large relative to the indi- 
vidual cost, ;f there 1s sufficient variation in the proportions of altru- 
ists present in the various groups, and (most awkwardly) :f the 
population 1s structured into isolated groups that mix and reassort 
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(again with sufficient variation in the proportion of altruists within 
each group) or otherwise compete 1n the formation of new groups It 
1s disappointing that Sober and Wilson do not provide a wealth of 
new empirical examples in which these conditions are clearly real- 
ized, however (especially since they suggest that such examples 
abound once our conceptual blinders are removed), because even 
their detailed and elegant discussions of these familiar examples (and 
convincing criticisms of earlier analyses of the conditions required for 
group selection by Sewall Wright and John Maynard Smith) do little 
to rebut the standard assessment of their position among evolution- 
ary biologists that defenders have shown selection at the level of 
groups to be a theoretical possibility, but one requiring such unusual 
conditions as to be rare and comparatively unimportant in nature Of 
course, part of their agenda 1s to advertise even this much progress 
for group selection to scholars outside evolutionary biology 
Furthermore, Sober and Wilson do offer an implicit reply to this 
appraisal, arguing that accounts of the evolution of altruistic traits 
traditionally considered widely applicable alternatives to group selec- 
tion—inclusive fitness, evolutionary game theory, and selfish gene 
theory—simply offer different perspectwes on the very same process of 
group selection described by their own multilevel selection account 
This conclusion depends, however, on their controversial conception 
of a “group” as absolutely any set of individuals whose interactions 
affect each other’s fitnesses, a proposal which has led many reviewers 
to suggest (mistakenly, I think) that the only issues here are semantic 
and that the authors try to win an argument over group selection 
simply by redefining 1t While Sober and Wilson are right to insist that 
evolutionarily significant groups are defined by the pattern of inter- 
actions among individuals they make possible (rather than spatial or 
temporal boundaries), their bid for generality nonetheless rests upon 
supposing that this liberal conception of group selection will be a 
helpful or perspicuous way to think about the causes of evolutionary 
change in an implausibly broad range of cases They invite us, for 
example, to conceive of a population randomly pairing up for rounds 
of prisoner’s dilemma as a single global population that 1s repeatedly 
divided into many ephemeral groups of size two and 1n which group 
selection runs rampant while cooperative behavior 1s exploited and 
therefore selectively disadvantageous within mixed groups (pairs) of 
cooperators and defectors, groups with higher proportions of coop- 
erators (100% versus 50% versus 0%) ultimately contribute more 
offspring to the global population (note that this 1s true of groups 
containing 50% altruists (that ıs, one altruist) only because of the 
greater productivity of the selfish member) Although the numbers 
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can be correctly crunched under this description, our understanding 
of why some strategies evolve and others do not 1s surely better 
advanced by the game theorist’s usual account, that 1s, by noting the 
payoff to each strategy against each potential interactor and the rate 
at which each 1s encountered—in a word, by calculating the average 
payoff each strategy receives when played against the population at 
large The game-theoretic description’s explanatory advantages be- 
come still more evident when we compare the two accounts’ compet- 
ing characterizations of the further condition required ın order for 
the cooperative strategy to evolve here for the game theorist, coop- 
erators must be more likely to interact with one another (whether 
because of population structure or viscosity, the association of kin, 
communication, the recognition of like-minded souls, or whatever), 
while Sober and Wilson must represent this same condition as an 
increase of variation in the composition of their ephemeral “groups ” 
Thus, even if we accept Sober and Wilson’s suggestion of theoret- 
ical equivalence between multilevel selection theory and its apparent 
competitors (and are consequently reluctant to deny categoncally that 
these cases could be regarded as instances of something plausibly 
called ‘‘group selection," even if ıt ıs not quite the classical group 
selection attacked by Willams, Maynard Smith, and others) the cen- 
tral issue becomes how often it will be useful to think of this single 
process as a form of group selection Despite repeated assurances, 
Sober and Wilson provide few concrete reasons to think that mult- 
level selection theory will provide a broadly perspicuous or helpful 
perspective, even (as the example of two-player prisoner’s dilemma 
with correlated interaction illustrates) ın all those cases ın which 
population structure plays an important evolutionary role 
Fortunately, these considerations do not undermine the interest of 
the remainder of the book, for Unto Others goes on to place human 
beings centerstage, and Sober and Wilson go on to argue that ances- 
tral populations of human beings did satisfy the special requirements 
of traditional group selection, amplified by the fact that our cognitive 
sophistication enabled altruists to selectively interact more easily, 
made it possible to evolve mildly altruistic systems of rewards and 
sanctions that enforced more dramatically altruistic behaviors, and 
rendered competing human groups increasingly distinct and inter- 
nally homogeneous through the influence of cultural selection So- 
ber and Wilson’s anthropological evidence (drawn from a random 
sample of anthropologists’ field descriptions of human cultures in 
the Human Relations Area File) ground arguments that are among 
the most speculative ın the book, but they are also some of the most 
interesting, and they make at least a prima facie case deserving 
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serious consideration (nor do the authors claim more) for the 1m- 
portance of group selection and the ubiquity of altruism in human 
evolutionary history 

The second half of Unto Others turns to questions of psychological 
(as opposed to evolutionary) altruism, contrasting the case for psy- 
chological egoism (the claim that our ultimate psychological motives 
are uniformly self-directed) with a pluralism that includes ultimate 
(rather than merely instrumental) motives that are genuinely altru- 
istic (that 1s, directed toward promoting the welfare of others) They 
effectively demolish weak philosophical and empirical arguments on 
each side (including misguided appeals to parsimony) and seek to 
break what they regard as the ensuing stalemate with evolutionary 
considerations using parental care as their example, they argue that 
considerations of reliability (mn advancing fitness), availability (in 
ancestral conditions), and efficiency (ın energetic cost) jointly favor 
the evolution of a motivational pluralism, including genuinely altru- 
istic motivations over the egoistic alternatives 

Sober and Wilson’s evolutionary argument errs, however, 1n treat- 
ing hedonism as the sole representative of psychological egoism This 
1$ defended by pointing out that versions of egoism which appeal to 
desires for external goods invariably wind up invoking hedonistic 
motives to explain away apparently altruistic actions, but this simply 
does not support the authors’ conclusion that an equally exclusive 
commitment to hedonistic motives must be the most difficult form of 
egoism to refute! Indeed, the toughest opponent for the thesis of 
genuine psychological altruism, especially in an evolutionary context, 
1s a pluralstec egoism ın which our self-oriented impulses include a 
wide variety of internal and external cues that are (or were 1n ances- 
tral conditions) relevantly connected to our fitness This way of 
thinking about human motivation 1s familiar from the literature of 
sociobiology and evolutionary psychology, and it ıs especially impor- 
tant for the authors to consider, given that the core of their argument 
for motivational pluralism 1s that a genuinely altruistic psychological 
mechanism for ensuring parental care will enjoy greater reliability zn 
promoting fitness than the hedonistic alternative they consider 

Setting aside their choice of target, Sober and Wilson do not take 
seriously enough the question of whether genuinely altruistic psycho- 
logical motives would be easier or more likely to evolve than hedo- 
nistic counterparts After all, parental care 1s neither the only nor 
even the most important behavior that well-adapted organisms need 
to perform, so any organism that engages in it will need a mechanism 
for trading off ts impulses toward parental care with those directed 
toward other important behaviors While Sober and Wilson argue (1n 
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connection with comparative efficiency) that even a hedonist re- 
quires some mechanism for trading off the various kinds of pains and 
pleasures that make up her motivational structure (including the 
propositionally-mediated variety hedonists will need even to treat 
other-directed desires instrumentally), 1t remains reasonable to think 
it more likely for an organism to evolve motivations for an important 
new behavior that fits into the (comparatively) common currency of 
an existing hedonistic mechamism for trading off each of 1ts compet- 
ing motivations with one another (the most likely ancestral condition, 
as Sober and Wilson recognize at least for the sake of argument) than 
to evolve an entirely new mechanism for trading off a new fundamen- 
tal motivation against each of them 
Ulumately, this pattern 1s representative of Unto Others as a whole 

Sober and Wilson offer a trail-blazing blend of insight, speculation, 
evidence, argument, and conjecture which will surely fuel serious 
thought and discussion 1n a wide variety of fields for a long time to 
come, even as the raw ambition of their case repeatedly leaves its 
supporting argumentation lagging behind, gasping and racing to 
catch up 

P KYLE STANFORD 
University of California/Irvine 


The Bounds of Agency An Essay «n Revistonary Metaphysics CAROL ROV- 
ANE Princeton Princeton University Press, 1998 1x + 260 p Cloth 
$45 00 


The aim of Carol Rovane's The Bounds of Agency 1s to substitute an 
adequate and coherent conception of personhood for the confused 
mixture of animalism and Lockean psychologism which our intui- 
tions encompass The assumption governing this effort 1s that despite 
the lack of agreement over what follows if an entity 1s discovered, or 
deemed to be a person, the possession or acquisition of the property 
of personhood ıs valuable Although it cannot be better for objects in 
general to be persons rather than things, it ıs better for human beings 
to be persons than nonpersons Criteria for personhood have to 
support the intuition of superior value, at the same time, 1t 1s risky to 
load the notion with too many duties or privileges held to be know- 
able a priori and necessary Rovane occupies the middle ground by 
claiming that "person" 1s a normative notion, though ıt is neutral with 
respect to specific theories of obligation Unlike P F Strawson’s 
“descriptive” metaphysics of persons which reassured readers that 
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their ordinary reactive attitudes stood 1n no need of further Justifica- 
tion, Rovane's revisionary metaphysics 1s intended to unnerve Our 
animalist prejudices, she thinks, interfere with our ability to recog- 
nize persons where they exist Our simultaneously held psychologistic 
prejudices interfere with our ability to perceive social injustice be- 
cause they lead us to focus on the wrong attributes as constitutive of 
personhood In the end, the tension between the book's commut- 
ment to ethical neutrahty and its revisionary aspirations. remains 
unresolved Nevertheless, there is much to learn from and to ponder 
in this thoughtful and deeply intelligent study 

Rovane begins by arguing that Locke was right to insist that animal 
identity—the organic integrity and persistence of a human body— 1s 
neither necessary nor sufficient for personal identity But our reac- 
tion to thought experiments involving multiple realizations of copies 
of our mental contents or losses of large chunks of them indicates 
that we are unprepared for a complete break with animalism Rather 
than try to weld together parts of animalism and psychologism, or 
give up the analytical effort, Rovane goes down a fourth, distinct path 
Although she gives some attention to neo-Lockean thought experi- 
ments, she 1s little concerned with the problems of reidentification or 
with mventing or solving puzzle cases She argues that to be a person 
1s to be capable of engaging in “agency-regarding relations” “[T]he 
class of persons 1s the class of agents who mutually recognize one 
another as facing, in all of their relations, the same ethically signifi- 
cant choice concerning whether and when to engage in agency- 
regarding relations” (49) An agency-regarding relation arises 
between agents “when one agent attempts to influence another, and 
yet aims not to hinder its agency " A number of conditions must be 
realized for such relations to be possible First, there must be agents 
who attempt with some success to carry out complex activities and 
pursue distant goals, and who communicate with one another in 
order to do so Second, these agents must be committed to achieving 
what Rovane calls “overall rational unity” A creature with this com- 
mitment strives to reduce conflicts and inconsistencies ın its belief 
sets, to order its preferences 1n noncontradictory ways, and to act in 
ways consistent with its beliefs Third, these agents must be able to 
recognize other agents like themselves, and must be recognizable by 
those other agents 

Persons reveal their commitment to overall rational unity in inter- 
nal deliberation and self-criticism, but also through consultation with 
one another Agents give one another advice, discuss and criticize 
each other’s plans, point out limitations and possibilities that have 
been overlooked and implications that were unseen, and they are 
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naturally receptive to taking advice But they also threaten, manipu- 
late, deceive, mislead, and even mesmerize each other, and they are 
also unfortunately receptive to these efforts to direct, redirect, and 
hinder their agency Generally speaking (though some minor qual- 
fications are recognized on pages 94-95), to the extent that one's 
relations with some entity consist of efforts to influence its behavior 
by lies, threats, or the provision of incentives, one 1s not treating 3t as 
a person, to the extent that some creature cannot be influenced 
except ın this way, it ıs not a person Rational influence, by contrast, 
“always proceeds by trying to find and present reasons that another 
person will find compelling ın light of its own rational point of view" 
(137) Psychologism has some truth, insofar as the persistence of 
memory, the capacity to project oneself into the future, and relative 
stability of preferences and dispositions, help to diagnose person- 
hood But “persons endure over time only insofar as they have 
commitments to unifying projects, and these unifying projects give 
them renewed reasons, at the different moments of their lives, to 
strive for overall rational unity over time" (165) No person can 
introspect its own personhood, except by discovering its capacity to 
exercise agency and its commitment to rational unity, and no exter- 
nal observer can individuate persons by individuating human bodies, 
not even fully-grown, wide-awake, bodies that have not been sub- 
jected to comissurotomies, transplantations, implantations, graftings, 
splittings, and similar procedures We choose to engage 1n or refrain 
from entering into agency-regarding relations, and ın that sense we 
decide whether certain other human animals are persons, or fully 
persons, or persons all the time, and we also decide whether to be 
persons vis-à-vis particular others 

Rovane denies that any specific obligations toward human beings 
or persons follow from her characterizaton The moral program of 
an egoist or a utilitarian might require him to engage mn agency- 
regarding relations with no one, or with very few actual persons 1n 
order to realize the goal of bringing about the greatest happiness for 
himself, or for the greatest number At the same time, she claims that 
nonacknowledgment or partial acknowledgment of personhood was 
once common, and she implies that such patterns may be residual 
even 1n cultures that enjoy an enlightened discourse of rights and 
respect Some humans may engage some other humans effortlessly ın 
rational discussion while obstructing these others or failing to assist 
them 1n formulating their own long-term plans They may attempt to 
interfere with their agency by threats, bribes, and mesmerization 
And they may convince themselves that dealing with them otherwise 
1s 1mpossible 
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Sometimes, agents wish to retreat from engagement with other agents 
as person, but without facing up to the fact that they have thereby made 
and carried out an ethically significant choice They do this by denying 
that the agents who occasion the choice are really persons By denying 
this, they manage to deny that these agents have any preferences in favor 
of being engaged with as persons Indeed they manage to deny that such 
engagement 1s even possible (119) 


Rovane characterizes the attitude toward semipersons as “hypocriti- 
cal” If legislators and officials m earlier centuries had believed with- 
out reservation that women could not think well, manage money, or 
perform surgical operations, there would have been no reason to pass 
new laws prohibiting the holding of certain offices and participation 
in certain activities or to defend the old laws, they both acknowledged 
and did not that women were persons Rovane does not embrace the 
claim that differential restrictions on humans violate universality 
requirements and are always wrong, the hypocritical posture, she 
thinks, might be a psychological precondition of furthering some 
moral aim (121) It 1s only wrong, she thinks, to deny that the 
hypocritical posture 1s of ethical moment (6) 

Despite the appeal to the higher mental faculties, Kantians will not 
like this We are told that the commitment to overall rational unity 
consists 1n 


resolving contradictions and other conflicts amongst one's beliefs and 
other psychological attitudes, accepting the implications (both logical 
and evidential) of one’s attitudes, ranking one’s preferences, identfying 
opportunities for action, assessing the probable consequences of per- 
forming the actions that are open to one, determining what means are 
available for achieving one’s ends, evaluating one’s ends, reassessing the 
relative preferability of one’s ends in light of both the available means to 
achieving them and the probable consequences of acting on them (139) 


Rovane concedes that not all human animals engage in this optional 
mental work, and she emphasizes that there 1s no categorical require- 
ment for them to do so In some respects, the admissions qualifica- 
tions to personhood are set low A person need strive only to acquire 
well-ordered beliefs and consistent preferences to earn minimal eth- 
ical consideration, not true beliefs or good preferences As a result, 
the reason for the normative carryover from personhood to even 
minimal entitlement remains unclear On the other hand (though 
some effort 1s made to avoid overintellectualizmg (146-47)), the 
requirements are set high Because Rovane is interested in the phght 
of persons-who-are-not-recognized to be such, she represents it as 
unconscionable that there should be persons who are fully commut- 
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ted to rational unity who still receive none of the privileges of 
personhood But most human beings are acting, living, and preserv- 
ing their beings, as Baruch Spinoza says, but not devoting much effort 
to uncovering and reconciling latent contradictions in their belief 
systems, and most do not have careers or life-long ambitions either 
Suppose a creature brought up 1n constrained circumstances—in a 
harem, for example, or in a family of slaves, fails to exert and to signal 
its agency, 1t devotes no effort to cognitive maintenance and upgrades 
and forms no long-term plans It is not on Rovane's criteria a person 
and has no preference for being regarded as such, therefore no harm 
is done when the agents around it refuse to engage it in agency- 
regarding relations Most Kantians think there ıs something wrong 
with this Now it might be a mere prejudice that most members of the 
species homo sapiens had better come up persons on any account 
Why should “person” not be a somewhat exclusive designation, re- 
served for a few? But the problem 1s that Rovane’s conditions for 
personhood are in most respects hard, while the entitlements they 
generate are soft Being a person does not carry with ıt an automatic 
right to noninterference or regard, but only to the minimal ethical 
consideration involved in someone’s having to decide that it 1s all 
right to interfere with you or dismiss you, and having the mere 
potential to be a person, being potentially educable, potentially 
project oriented, and so on seems to carry no entitlement at all So 
what the theory really says ın the end is that, if you are one of those 
creatures which has a commitment to rational unity and long-term 
plans, you ought to be treated as such unless there are overriding 
moral reasons not to But this 1s not a metaphysical theory of person- 
hood, it 1s a theory concerning a widespread form of social injustice 

This argument up to here occupies roughly the first half of the 
book The second half 1s devoted to a further attack on animalism 
and psychologism through a demonstration of the possibility of mul- 
tiple persons within a single human body, and group persons that 
lack the introspective capacities of a typical Cartesian-Lockean con- 
sciousness Although corporate persons have been discussed exten- 
sively ın the business-ethics hterature, and although multiple 
personalities have come into prominence with Ian Hacking’s skept- 
cal treatment, there are several innovations here One 1s a redeploy- 
ment of Thomas Nagel’s technique of modeling altruistic behavior 
on prudential behavior in The Possibility of Altrusm! to meet the 
objection that group persons lack the introspective abilities needed 
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to pursue intentional goals Group persons do not require introspec- 
tive access to shared memories or other subjective accoutrements, 
they need have only a commitment to overall rational unity with 
respect to some set of future-directed plans The second 1s a rede- 
ployment of H P Gnrice's theory of communication In prudential 
behavior we anticipate communicating with our past selves by recog- 
nizing their previous intention to affect us in a certain way, and group 
persons develop analogous projective relations with their future 
selves (149ff ) 

The possibility that more than one full person can inhabit a single 
body 1s given content through a sympathetic and credible recounting 
of the case of Sybil, the famous “multiple,” but also through the 
exploration of the idea that we individual human-sized persons are 
more like Sybil than we might at first think We typically have com- 
mitments to many distinct long-term activities, each of which requires 
a sustained coordinated effort over time, and each of which makes 
different demands on our cognitive ordering and forecasting abilities 
(174) Working as an academic, making progress with some musical 
instrument, and keeping a household going are essentially uncon- 
nected and even conflicting projects, and somewhat different person- 
alities are required for party going and for chairing meetings Unlike 
Sybil, most of us try to make a single human life out of these separate 
streams of activity, and personality-drag, which Sybil did not experr- 
ence, ensures that some of our underlying frivolity or dourness gets 
ported from one venue to another With a little imagination, ıt 1s easy 
to see that our sublives could take on the isolation of Locke’s Day- 
person and his Night-person 

Although the endorsements on the back cover suggests that The 
Bounds of Agency 1s a convincing or at least very serious defense of the 
possibility of group and multiple persons, the book’s intention 1s not 
I think to provide such a defense Rovane insists that she 1s interested 
only in the pure possibility of nonstandard persons and takes no 
position on whether we actually meet with them 1n space and have to 
deal with them appropriately Very little 1s said about how to recog- 
nize a group or multiple person and what to do if faced with one 
Rovane's principle of moral neutrality implies that, 1f two persons 
inhabited a single body, 1t might be morally permissible to deny both 
their agency by refusing to allow them to marry two spouses or collect 
two government pensions, even if their affiliation and consumption 
preferences and associated plans were wholly irreconcilable But this 
1s tantamount to conceding that animalist and psychologistic beliefs 
do enter into our notion of personhood and can overrule the alleged 
ethical criterion And Rovane seems to be less interested in setting 
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out a distinct notion of the pure possibility of a nonstandard person 
than in showing that we meet with indistinct versions of nonstandard 
persons routinely and m space The book’s message—contrary to 
what its title suggests—is that our agency 1s not bounded but is 
constituted by our identification with programs we selectively take up 
and cast aside, and with our being seen as doing so and as capable of 
doing so by others “The ethical criterion [of personhood] suggests 
that these boundaries [between individual human animals] do not 
need transcending, because we already live on both sides of them and 
moreover we can always redraw them” (208) 

As the author points out, we are surrounded by and participate in 
numerous joint enterprises—intellectual partnerships, social and po- 
litical organizations, marriages These enterprises can bind people 
into units which have their own interests and which must perform the 
kind of planning and cognitive maintenance that, according to Rov- 
ane, all real persons do spontaneously Under ideal conditions, non- 
standard persons keep track of one another’s contributions and 
thoughts about their project, have shared beliefs including beliefs 
about others’ commitment to the project, and attempt to influence 
each other rationally about the direction of the project (145) 

This 1s a well-developed, complex, and gracefully wntten book, 1t 1s 
brimming with ideas It positions itself in the largely unoccupied 
space between the morally serious but philosophically feeble htera- 
ture on the social construction of the self and the ngorous but 
just-for-fun literature of splits and fusions, it 15 rigorous and serious, 
too It invites a reconsideration of the connection between voluntary 
and structural curtailments of agency, and respect for persons, 
though 1t deliberately downplays its substantive moral message Phi- 
losophers of science discovered the interest of the problems of col- 
lective rationality and irrationality and have been exploring them 
since, but standard individual projects continue to be focal in moral 
philosophy The Bounds of Agency suggests that ıt will prove equally 
rewarding to look at the ethical features of small-scale cooperative 
enterprises 

Not only Kantians but also animalists will have their reservations 
nevertheless A solid treatise by an animalist on personhood has yet to 
be written, but I think 1t would say something like this The significant 
properties of human beings, including their capacity and tendency to 
think of themselves as persons enutled to moral consideration, as well 
as their capacity and tendency to regard other humans sometimes, 
but not always as persons, are explained by facts about the kinds of 
animals we are and the instructions encoded within our specific type 
of animal bodies The intuition of a self-same self has a biological 
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function, and the ability to reidentify other human animals as the 
very same ones, and to ascribe to them self-same selves despite 
changes of appearance and context 1s important for us Other facts 
about our animal bodies and their histories explain why interference 
and manipulation and the refusal to engage in agency-regarding 
relations are ubiquitous For the animalist, our theoretical interest ın 
the metaphysical question of personal identity reflects the elevation 
of a practical problem, critical to our lives as social animals, into a 
theoretical problem calling forth our native ingenuity, our skills at 
the rotation of figures, the permutation of possibilities, the detection 
of isomorphisms, these transformative experiments being typical 
mental amusements of our species Rovane’s argument against ani- 
malism 1s the intuition that several persons can exist in one body and 
that a number of human bodies can be associated with one person 
But all proposed criteria for personhood, even the most rationalistic, 
depend on our endowments as long-lived creatures that happen to 
live by our wits, our hands, and our capacity for speech, as the other 
animals do not Although group and multiple persons are theoreti- 
cally possible, the theory of persons does not explain the empirically 
observed fragility of groups that are committed to a shared rational 
unity and the psychological difficulty of living a multiple hfe We 
humans can defect from any group just by walking away on our 
animal feet, and we often do, but we are stuck with ourselves, and 
even if doing so would lead to certain efficiencies and minimize 
conflicts we cannot fragment mto more persons and personalities 
than our physical constitutions allow 

CATHERINE WILSON 
University of British Columbia 
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THE PRINCIPLE OF SUFFICIENT REASON zy UM LT 


any attempts to refute Gottfried Leibmiz’s Principle of the 
M Identity of Indiscernibles (PII) and Principle of Sufficient Reason 
(PSR) turn upon symmetries If a symmetric world were 

possible, then PII would stand refuted, since such a world would 
include distinct objects sharing all of their qualitative and relational 
properties Similarly, if there were a class of possible worlds whose 
shared geometry and laws were invariant under:some set of symme- 
tries, then PSR would stand refuted, since there would exist distinct 
worlds sharing all of their qualitative and relational properties think 
here of the standard approach to Newtonian mechanics, under which 
the symmetry of the theory under Euclidean isometries implies the 
existence of distinct worlds with qualitatively identical geometries and 
arrangements of matter, differing only as to where the matter 1s 
placed in space 

Ian Hacking! once observed that symmetry-driven counterex- 
amples to PII are mherently conclusive One can always deny that 
the proffered description manages to characterize any possible world 
accurately, maintaining instead that the only genuine possibility in 
the neighborhood arises when the qualitatively identical objects of 
the original description are taken to be numencally identical 

I claim that PSR can be given a similar line of defense a description 
of a set of possible worlds which includes pairs of worlds with identical 
qualitative structures can always be taken to correspond to the sparser 
set of possibilities which arises when qualitatively identical worlds are 
identified 

My argument has two parts, held together by an admittedly loose 
interpretation of Leibniz the later sections form a commentary on 


1 *The Identity of Indiscernibles," this JOURNAL, LXXII, 9 (May 8, 1975) 249-56 
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some theorems concerning symmetries and the structure of the 
spaces of physical possibilities of classical mechanics, the first few 
sections are intended to set this discussion in context 


I HACKING’S STRATEGY 
Consider Max Black's? famous challenge "Isn't it logically possible 
that the universe should have contained nothing but two exactly 
similar spheres" (ibid, p 156)? If this 1s granted, then “every quality 
and relational characterisuc of the one would also be a property of 
the other" (sid), and PII 1s soundly refuted 

Symmetry plays a crucial role in arguments like Black's A two- 
sphere world ıs a potential counterexample to PII only if the spheres 
are images of one another under reflection in the plane bisecting the 
segment joining their centers—if they were different sizes or colors, 
for instance, PII would of course be perfectly safe Similarly, if we 
describe a two-dimensional Euchdean world containing three massive 
point-objects, then PII is threatened if and only if there 1s a symmetry 
of space which maps the triangle determined by the three points onto 
itself while preserving the masses of the vertices Thus, no scalene 
triangle threatens the principle, while an isosceles triangle poses a 
threat 1f and only if reflection ın its axis of symmetry interchanges 
vertices of the same mass An equilateral triangle poses a threat if and 
only if some pair of vertices are the same mass (1n which case masses 
are preserved by reflection in the axis perpendicular to the segment 
that Joins these points), if all three vertices are the same mass, then 
masses are also preserved by rotations of 120° and 240° degrees about 
the center of the triangle (these rotations permute the vertices with- 
out interchanging any pair) 

More generally, suppose that we are given a description of a set of 
material bodies, possessing certain properties and standing in certain 
(Euclidean) geometric relations to one another If this arrangement 
Is symmetrical—so that there ıs some nontrivial combination of 
reflections, rotations, and translations which maps the arrangement 
onto itself in a way that preserves properties—then, as in Black’s 
example, there will be a pair of objects that share all of their quali- 
tative and relational properties in virtue of playing identical roles in 
the pattern of relations and properties instantiated Thus the situa- 
tion described, if a genuine possibility, constitutes a counterexample 
to PII On the other hand, if there ıs no such symmetry of the 
situation, then PII 1s in effect, since each object is distinguished from 
every other ın virtue of some property ıt possesses, or the role that ıt 


? "The Identity of Indiscernibles," Mind, LXI (1952) 153-64 
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plays 1n the pattern of geometric relations holding between the 
objects 

Hacking’s defense of PII exploits the close relationship between PII 
and asymmetry His paper proceeds via ingenious elaborations of a 
simple theme any description of a symmetric arrangement of bodies 
can be taken to be a misdescription of an asymmetric state of affairs 
involving fewer bodies When Black asks Leibniz to imagine a world 
containing only two identical iron spheres, Leibniz can respond—as 
he did to Samuel Clarke—that “To suppose two things indiscernible, 
1s to suppose the same thing under two names ”3 

When presented with a story about a world comprising only two 
identical drops of water whose surfaces reflect one another, Hack- 
ing’s Leibniz responds that what has in fact been described 1s a world 
containing a single drop whose surface reflects itself Intuitively, this 
one-drop world 1s constructed from the ‘world’ originally described 
by identifying points of space and bits of matter which are mirror 
images of one another 

Opponents of PII can offer more and more elaborate worlds as 
counterexamples But because these worlds will always involve sym- 
metric arrangements of matter, proponents of PII will be able to 
defuse the threat by identifying qualitatively idenucal objects to gen- 
erate asymmetric worlds that obey the principle Implementing this 
strategy may sometimes require unseemly contortions Some truths 
about the proffered symmetric world will carry over to its asymmetric 
replacement (‘Each drop reflects some drop’), some truths will not 
(‘No drop reflects itself) Thus, as Hacking notes, 1t can happen that 
the symmetric world admits a simple description, beside which any 
description of the asymmetric world appears baroque Hacking in- 
sists, though, that this sort of thing can at most embarrass Leibniz—it 
cannot force him to abandon PII 


II. ONCE MORE, ABSTRACTLY 
I think it helpful to have in view a more abstract description of 
Hacking’s strategy (Readers who do not hike the sound of that may 
prefer to skip this section ) We are interested 1n structures and their 
symmetries * A structure consists of a set of objects, A, together with a 


3See H G Alexander, ed, The Leibniz-Clarke Correspondence (New York Barnes 
and Noble, 1956), LIV 6. Here ‘LIV 6' indicates the sixth paragraph of Leibmz's 
fourth letter Future references to this volume will be noted in the text using this 
same abbreviation 

4 What follows 1s a sketch of some elementary model theory, phrased somewhat 
idiosyncratically This materal can be found in, for example, Wilfred Hodges, A 
Shorter Model Theory (New York Cambridge, 1997), see his §§1 1-1 4, 2 3, and 8 3, 
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set, {R}, of relations on A (including, possibly, one-place relations— 
namely, properties) 5 A symmetry of a structure is a permutation of its 
objects which fixes each of its relations 9 We equip ourselves with a 
first-order language with identity which includes a predicate symbol 
for each relation of our structure, a set of variables that range over 
objects, and a set of constants large enough to provide a name for 
each object 7 

This language provides two principal modes for describing our 
structure We can use variables but no names, and focus on theores— 
(arbitrary) sets of true sentences about our structure which employ 
neither constants nor free variables Alternatively, we can choose to 
employ names but no variables setting up a nomenclature—an assign- 
ment of a constant of our language to each object of the struc- 
ture—we can regard the constants as names for objects, and 
formulate true statements about the objects named In particular, 
each nomenclature 1s associated with a complete descrption—the set of 
all true atomic sentences about the structure relative to this system of 
naming A complete description of a structure does what no theory 
about an infinite structure can do— determmes it up to isomorphism 

Let us fix a structure, and consider a nomenclature and its associ- 
ated complete description Each permutation of the set of objects of 
our structure induces a new nomenclature and a new complete 
description Symmetries are the permutations that leave invariant our 
initial complete description Thinking of symmetries as permutations 
of names rather than objects, we can see that they leave invariant the 
list of true sentences describing the structure 

The existence of a symmetry mapping object a to object b indicates 
that a and b play the same role in our structure Thus, if a and b are 
distinct objects in a structure describing a possible world then we 
have a counterexample to PII Conversely, if a structure admits no 


eS 


keeping ın mind that the map that we are interested in below—under which a 1s 
sent to [2]—1s a subjective homomorphism which preserves one-place relations 

5 It i5 sometimes convenient also to include in our structures functions on their 
sets of objects Taking these into account here would be straightforward enough, 
but unwieldy—so I proceed with relations alone for the time being, but help myself 
to functions later 

ê A permutation ıs a bijection from the set of objects to itself Any such map also 
acts on -tuples of objects in an obvious way (that 1s, entry-wise), and so maps each 
n-place relanon— conceived of as a set of n-tuples, on which the permutation acts 
element-wise—to an place relation The upshot ıs that a permutation $ ıs a 
symmetry if. R(a, ,a,) if and only if R(b(a,), , $(a,)) 

7 This could be a very large language—no restriction 1s placed on the cardinalities 
of the sets of objects and relations 
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symmetries, this indicates that each of its objects plays a distinct role ? 
Thus a structure (of physical objects) contains nomdentical indis- 
cernibles 1f and only if 1t admits a nontrivial symmetry 

We can now describe Hacking's strategy as follows Having fixed an 
arbitrary structure, we construct a closely related quotient structure by 
identifying objects related by symmetries We define an equivalence 
relation on the set of objects of our original structure by declaring 
that a—b if there exists a symmetry of that structure, $ A—4A, with 
$(a)=b, we write [a] for the equivalence class of a, (b. a~b} The set 
of objects of the quotient structure is {[a] a€ A], the set of equiva- 
lence classes of objects of the original structure The quotient struc- 
ture has a relation [R] for each relation R of the original structure, 
LR] is the smallest relation such that if R(aj, ,4,) for some n-tuple of 
objects of the original structure, then [R] ({a], ,[a,]) 

We can use the same language to describe both structures (using 
the same predicate symbol, ‘ft’, as a name for both R and [R]) 
Choosing names for our new objects generates a complete descrip- 
tion of the quotient structure This will coincide with a complete 
description of the original structure (relative to some nomenclature) 
if and only if the two structures are isomorphic—that ıs, if and only if 
the original structure possesses no nontrivial symmetries 

Even when the original structure does admit nontrivial symmetries, 
the complete descriptions of the two structures will be closely related 
A complete description of the original structure can be transformed 
mto a complete description of the quotient structure by taking names 
of objects related by symmetries to name identical objects ? Thus, a 
complete description of the original structure can be taken to be the 
sort of misleading description of the quotient structure which results 
when some objects are given multiple names 

We can also consider the relations between theories describing the 
two structures Some sorts of (constant-free) sentences are true of the 
quotient structure if they are true of the original structure This holds 


8 Note the disunction between the symmetries under discussion here and the sort 
of permutations that drive Hilary Putnam's famous argument about reference We 
are interested in the nontrivial automorphisms of a structure—nontrivial permuta- 
tions that fix each of the relanons of a given structure There are many structures— 
such as well-ordered sets—which do not admit any nontnvial symmetries Putnam, 
on the other hand, shows that any (reasonably rich) structure 1s isomorphic to a 
structure which has the same set of obyects but whose relations are generated by 
arbitrary permutations (and which, therefore, will typically be distinct from the 
relations of the orginal structure), see the Appendix to Reason, Truth, and History 
(New York Cambridge, 1981) 

° This would not be true if we had included nonatomic statements in our 
complete description— consider the fate of a statement like ‘a#b’ when a—b 
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for sentences which are negation-free and those which mention only 
one-place relations More can be proved if a sentence is in negation 
normal form (so that all of its negation symbols apply to atomic 
formulas) and all of its negation symbols apply to one-place relations, 
then it ıs true ın the quotient if ıt is true in the onginal structure 1° 
But not too much more suppose that the original structure mcludes 
an irreflexive binary relation, R, and objects a and b such that R(a, b) 
and a~b, then in the quotient structure, we have [R] (La) [5]), with 
[a] ^ [5], so the sentence ‘Vx —Pt (;x)', attributing irreflexivity to R 
and [R], ıs true ın the orginal structure but false in the quotient 
structure 

The upshot whenever we have a structure that admits nontrivial 
symmetries, we can factor these out, constructing a quotient structure 
closely related to the original one In special cases—such as when all 
of the relations of the structures are one-place— every theory true of 
the original structure is also true of the quotient !! 

But in general the two structures will be described by related but 
distinct theories Hacking argues that the committed advocate of PII 
can brush off one worry aroused by this divergence that theortes 
describing the quotient world may be complex compared to those 
describing the original structure 

There 1s, however, a second worry in the neighborhood theories of 
the quotient may be too semple n comparison to those of the original 
world The following cases were suggested to me by Kit Fine Consider 
two structures for a given countable set of objects, 1n one structure 
the set of objects 1s given an ordering isomorphic to the integers, in 
the other an order isomorphic to the rationals The quotient of each 
structure 1s just a single object, related to itself Here we have 
somewhat rich structures that collapse to a disappomtingly simple 
one when we pass to the quotients How sure can we be that taking 
the quotient of a ‘world’ will yield a possible world? 

Again, the committed Leibnizean need not be troubled “Things 
which are uniform, containing no variety, are always mere abstrac- 
tions "1? But the more moderate among us will probably conclude 


10 For a deeper result that goes m the opposite direction, see Hodges, Shorter 
Model Theory, Theorem 8 3 3(b) 

^ This special case assumes premier importance for those who, following Ber- 
trand Russell’s A Critical Exposition of the Philosophy of Leibniz (New York Routledge, 
1992), have Leibniz maintaining that all facts can be reduced to properties of 
monads For a recent defense of this line, see J A Cover and John O’Leary- 
Hawthorne, Substance and Indwiduation in Leibniz (New York Cambridge, 1999), 
chapter 2 

12 New Essays on Human Understanding (New York Cambridge, 1996), SII 12 
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that both of these worries render it necessary to examine the relation 
between a structure and its quotient on a case-by-case basis in order 
to discern whether the two can indeed be taken to correspond to the 
same sorts of complexes of possibilities 


III HACKING'S CONCLUSION 

Hacking sets out to show that PII can be defended against a certam 
sort of counterexample presentations of symmetric worlds He pro- 
poses that any description of a symmetric world, in which multple 
objects play the same role in the pattern of properties and relations 
instantiated, can always be taken to be a misdescription of an asym- 
metric world that respects the principle That this strategy 1s avail- 
able—even 1f there ıs room to doubt its prudence 1n many cases— 
shows that PIL can be protected from counterexample by 
spatiotemporal symmetry If we indulge in a little rational reconstruc- 
tion, and take PII to amount to the denial of the possibility of 
symmetric worlds, then we reach Hacking’s conclusion— "the pursuit 
of logical questions might settle the issue of [PII], but mere reflec- 
tion on spatiotemporal examples is never enough” (op cit, p 249) !? 

Hacking!* himself has proposed a conception of the nature of logic 
which does imply a version of PII At the same tme, ıt fails to 
underwrite the catechetical principle that identity implies necessary 
identity 15 In defending this lapse of orthodoxy, Hacking registers his 
doubts about other views concerning rigid designation which form 
the very basis of the faith He rejects the claim that 1n naming two 
actual qualitatively identical drops of water Jake and Jesse we gain the 
ability to describe two distinct possible worlds Beta and. Gam, the first 
inhabited only by Jake, the second only by Jesse 


by cold naked stpulation we produce not two possibilities but one 
Beta and Gam, for all our naming, are the same world, populated by one 
drop, with just so many properties I am not here arguing a case for or 
against “trans-world identification " I am arguing that there 1s no sense 
in postulating two distinct worlds which have no point of disuncton 
except in the sheer names of their inhabitants (bid, p 626) 


13 Leibniz, however, seems to have had a much broader conception of the 
content of PII at L III 5 he appears to use it to rule out the existence of pairs of 
qualitatively identical worlds, ıt also appears to forbid the existence of perfectly 
similar leaves, drops of water, or eggs ın our own asymmetric world (for example, 
at LIV 4 and L V 2, for further references and discussion, see §VII 3 of Benson 
Mates, The Philosophy of Lebniz Metaphysics and Language (New York Oxford, 1986)) 

14 “On the Reality of Existence and Identity," Canadian Journal of Philosophy, v111 
(1978) 613-32, here pp 618-19 See also his “What Is Logic?,” this JOURNAL, LXXVI, 
6 (June 1979) 285-319 

15 “On the Reality of Existence and Identity,” pp 620-21 
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As Hacking goes on to pomt out, “this is not an argument about the 
identity of mdiscernibles," as he construes that principle 19 

But the question of the existence of such pairs of qualitatively 
identical worlds 1s intimately involved with PSR, as Leibniz deployed 
it in his dispute with Clarke 


IV LEIBNIZ AND CLARKE 
Early in his correspondence with Clarke, Leibniz asserts that the 
reversal of the decay of natural rehgion m England can never be 
accomplished by means of that country’s native, mathematical, prin- 
ciples alone—metaphysical principles are required The principle of 
contradiction suffices to found arithmetic and geometry, but “in 
order to proceed from mathematics to natural philosophy, another 
principle ıs requisite, as I have observed ın my Theodicy I mean, the 
Principle of Sufficient Reason, viz that nothing happens without a 
reason why it should be so, rather than otherwise” (L II 1) 
Archimedes's!? postulate that “Equal weights at equal distances are 
in equilibrrum, and equal weights at unequal distances are not in 
equilibrium but incline towards the weight which is at the greater 
distance” is a special case of this principle Much of the controversy 
between Leibniz and Clarke hinges on the question of the extent to 
which God—unhke a balance or an ass—is capable of arbitrarily 
choosing between qualitatively identical alternatives In pressing this 
question, Clarke raises the following difficulty 


why this particular system of matter, should be created ın one partc- 
ular place, and that in another place, when, (all place being absolutely 
indifferent to all matter,) 1t would have been exactly the same thing we 
versa, supposing the two systems (or the particles) of matter to be alike, 
there can be no other reason, but the mere will of God Which if it could 
m no case act without a predetermining cause, any more than a balance 
can move without a preponderating weight, this would tend to take away 
all power of choosing, and to introduce fatality (C II 1) 


Leibniz never answers this objection to Clarke's satisfaction any more 
than Clarke ever demonstrates to Leibniz his comprehension of the 
great principle under discussion 

But the particular example broached by Clarke in this passage 
leads to one of the most interesting of the many proliferating side 
issues under discussion Leibniz jumps at the chance to air his com- 
plaints about the Newtonian doctrine of absolute space He opens 


16 Again, compare L III 5 
17 The first postulate of On the Equilibrium of Planes m Thomas Heath, ed , The 
Works of Archimedes (New York Dover, 1953), p 189 
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with a bold bluff “I have many demonstrations, to confute the fancy 
of those who take space to be a substance, or at least an absolute 
being But I shall only use, at the present, one demonstration, which 
the author here gives me occasion to insist upon" (LIIL5) He goes 
on to recapitulate Clarke’s own argument “ts impossible there 
should be a reason, why God, preserving the same situations of bodies 
among themselves, should have placed them im space after one 
certain particular manner, and not otherwise, why every thing was not 
placed the quite contrary way, for instance by changing East into 
West " We are now to read this argument, however, not as a refutation 
of PSR by counterexample, but as a reductio of the supposition of 
absolute space which proceeds via an appeal to that principle This 
clears the way for Leibniz’s own conception of space 


But if space is nothing else, but that order or relation, and 1s nothing at 
all without bodies, but the possibility of placing them, then those two 
states, the one such as it 1s now, the other supposed to be the quite 
contrary way, would not at all differ from one another Their difference 
therefore is only to be found in our chimencal supposition of the reality 
of space 1n itself. But in truth the one would exactly be the same thing 
as the other, they being absolutely indiscernible, and consequently there 
1s no room to enquire after a reason of the preference of the one to the 
other (L III 5) 


Where Clarke sees many possible arrangements of bits of matter 
consistent with fixed specification of their relative distances, Leibniz 
recognizes only one We can think of Clarke as generating his space 
of possibilities by considering some particular way of embedding a 
material system into Euclidean space which gives rise to the specified 
relative distances, then shifting, reflecting, and rotating this matenal 
distribution ın absolute space to generate many more, each satisfying 
the same set of relations of relative distance This 1s possible because 
of the “indifference” of space to matter, which also guarantees that 
each of the alternatives so generated will be qualitatively identical (in 
one world, thes object 1s here, 1n that one, there, but these two parts of 
space share all of their qualitative properties and relations) Leibniz 
protects PSR by identifying alternatives related by the symmetries of 
Euclidean space For him there 1s a single possibility here the bodies, 
m space, with such-and-such relations between them 


V PROTECTING PSR 
Notice that the symmetries at stake in the Leibmiz-Clarke correspon- 
dence are not symmetries of (typical) worlds—only very special ar- 
rangements of matter are mvariant under rotations, reflections, or 
translations Rather, the symmetries in question are symmetries of the 
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space of possibiltes consider the set of possible dispositions of some 
material system in absolute space, act on each possibility by (say) 
translating each particle by some given amount in some given direc- 
tion, the set of possibilities 1s invariant under this action (and under 
the actions of reflections and rotations as well), although few of its 
members are 

Part of the content of PSR involves the denial of the existence of 
qualitatively identical worlds !? In this guise the principle appears to 
be vulnerable to symmetry-dnven counterexamples By analogy with 
Hacking’s defense of his construal of PII, we see, however, that this 
part of the content of PSR can be protected from this class of putative 
counterexamples if we insist that symmetries should be factored out 
of the space of possible worlds 

Note that the apparatus of section 11 can be deployed whenever we 
have a set structured by some relations In Hacking’s defense of PII, 
the structured sets in question were possible worlds, whose objects 
corresponded to material objects, endowed with properties and 
standing ın geometric relations to one another Hacking advises us to 
factor out symmetries from such sets to ensure that they do not 
include objects sharing all of their qualitative properties and rela- 
tons In order to defend PSR against symmetry objections, we need 
to turn our attention to structured sets whose objects are possible 
worlds, factoring out symmetries from such sets to ensure that there 
are no distinct but qualitatively identical worlds 

Mathematical treatments of classical mechanics provide one of the 
few settings 1n which structured spaces of possible worlds arise in a 
natural way—and also constitute the natural context for the questions 
raised by Leibniz and Clarke So let us focus on that case We are 
given a phase space whose pomts represent physically possible instan- 
taneous dynamical states of our system—initial data Alternatively, 
since the physics ıs deterministic, we know that the space of physically 
possible histories must be isomorphic to the space of initial data 
(choosing a time at which to pose the initial data determines an 
isomorphism) So we can, as it suits us, take the points of a phase 
space to correspond to either possible worlds or to temporal slices of 
possible worlds 


18 Leibniz has uses for the principles that do not fit this mold Archimedes's 
principle mentioned above ıs one Another, very prominent, one 1s as a principle of 
maximization see the passage (from a letter to Princess Caroline) which 1s custom- 
arily printed as a postscript to L IV, or the letter to Des Bosses of August 19, 1715 
(Leroy Loemker, ed , Gottfried Wilhelm Leibniz Philosophical Papers and Letters (Bos- 
ton Reidel, 1970), p 613) 
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A classical mechanical phase space carnes a sort of geometric 
structure that wants only the specification of a distinguished function 
on phase space, the Hamiltonian, to determine a unique curve 
through each of its points !? If we think of points of phase space as 
representing instantaneous states, then each dynamical trajectory 
describes a physically possible history of states 20 In the most common 
case, the phase space parameterizes the set of possible positions and 
momenta of some set of particles or fields in Euclidean space Such 
a phase space carries a canonical geometric structure, and the stan- 
dard dynamics arise when the Hamiltonian 1s taken to be the function 
which assigns to each state its total energy Typically our Hamultont- 
ans are written as sums of kinetic energy terms (encoding inertial 
effects) and potential energy terms (encoding information about the 
action of forces) 

Now let us suppose, as 1s ın fact quite typical, that we have a classical 
mechanical theory which is invariant under some symmetries—so 
that there are byections from the phase space to itself which leave 
invariant the geometric structure and Hamiltonian of our theory 
The set of trajectories through two points of phase space related by 
such a symmetry will have the same structure (since the dynamics are 
determined by the geometric structure and the Hamiltonian, which 
are themselves invariant under the symmetry in question). And this 15 
to say that at the level of description we are working at—where all and 
only dynamical variables are included 1n our descriptions of possible 
states and worlds—that these two points represent qualitatively 1ndis- 
tinguishable possible states or worlds So the existence of symmetries 
of our dynamical theory poses a threat to PSR 

The advocate of PSR will observe that this threat can be deflected 
if we hold that it ıs not this origmal phase space but its quotient 
structure—the space of possibilities which arises when points of the 
original phase space related by symmetries are identufied—which 
parameterizes the space of dynamically possible worlds 

What ıs the price of this defense of PSR? We started with an elegant 
mathematical formulation of a physical theory In moving to a theory 
of equivalence classes of our original states, 1t 1s natural to worry that 
we will end up with something hopelessly ad hoc, perhaps with little 


19 What 1s needed 1s a means—typically provided by a symplectic form, a closed, 
nondegenerate two-form—for establishing a correspondence between vector fields 
and differential forms, one then solves for the vector field corresponding to dH, 
where H 1s the Hamiltonian, and integrates to find the dynamical trajectories 

20 If we think of points of phase space as corresponding to possible worlds, then 
two worlds lie on the same dynamical trajectory 1f and only if they contain the same 
sequence of states "shifted 1n absolute time " 
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claim to be considered a physical theory ın its own right In the next 
few sections, I shall make some remarks about theorems in geometric 
mechanics which ought to allay this fear ?! 


VI EXAMPLE CELESTIAL MECHANICS 

Let us apply this strategy to a familiar example—the classical theory 
of N gravitating point particles It 1s helpful to have two spaces in view 
here the 3N dimensional configuration space of the system (encoding 
the possible instantaneous dispositions of the N particles in three- 
dimensional Euclidean space), and the corresponding 6N dimen- 
sional phase space (parametenzig the possible instantaneous 
positions and velocities of the particles) ?? The phase space carries a 
canonical geometric structure, and the Newtonian dynamics are de- 
termined by taking as our Hamiltonian the sum of the kinetic energy 
and the gravitational potential energy 

If we consider some isometry of Euclidean space—a product of 
translations, rotations, and reflections—then we can see how it maps 
each point of the configuration space of the theory to some other 
allowing the isometry to act on each of the positions of the individual 
particles leads to a new configuration of the system Of course, since 
isometries preserve distance, the initial configuration and the trans- 
formed configuration instantiate the same pattern of relative distance 
relations between particles 

This action of the Euchdean group on the configuration space 
induces an action of the group on the corresponding phase space If 
we know how a given Euclidean isometry acts on each point along a 
curve in the configuration space, then we know how 1t acts on the 
curve itself, and this allows us to define an action of the group on 
tangents to curves in configuration space, which is just to say that we 
know how each isometry transforms the velocities of each of our 
particles The action is, 1n fact, straightforward a translation does not 
alter a velocity vector, a rotation or reflection acts on such a vector in 
the obvious way (rotating or reflecting 1t) 

Now the important point for us is that the group of Euclidean 
isometries 1s, ın fact, a symmetry group for our theory, leaving invari- 
ant the Hamiltonian and the geometric structure of the phase 


?! The best short introduction to the body of work discussed below is Jerrold 
Marsden, "Steve Smale and Geometric Mechanics," in M Hirsch, Marsden, and M 
Shub, eds, From Topology to Computation Proceedings of the Smalefest (New York 
Springer, 1993), pp 499-516 For more comprehensive treatments, see Marsden’s 
Lecture on Mechanacs (New York Cambndge, 1992), or Victor Guillemin and Shlomo 
Sternberg, Symplectic Techniques m Physics (New York Cambridge, 1984) 

?? It 1s harmless here and below to speak of ‘velocities’ where ‘momenta’ would 
be more proper 
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space—and, therefore, mapping dynamical trajectories to dynamical 
trajectories 73 

The relationalism about space manifested in Leibniz’s replies to 
Clarke suggests the following line of thought ?* The standard config- 
uration space overcounts possible configurations of particles by rec- 
ognizing many possible ways to instantiate each geometrically 
possible pattern of relative distances We ought to replace this con- 
figuration space by the relative configuration space—parameterized by 
the geometrically possible patterns of relative distances between the 
particles—which arises when we identify points m the standard con- 
figuration space related by the action of the Euclidean group We 
ought then to formulate a strictly relational dynamics, set ın the relative 
phase space (parameterized by the possible instantaneous relative dis- 
tances and velocities for the particles) 

It ıs easy to see, however, that this relative phase space 1s not rich 
enough to capture the empirical content of Newton’s theory if I 
merely specify the initial relative distances and velocities of two (or 
more) particles, I have not given enough information to fix the future 
evolution of their relative distances and velocities—depending on 
the total angular momentum of the system, the particles may even- 
tually fall toward one another, or they may form a stable or expand- 
ing orbit 

So a strictly relational reformulation of Newton’s theory 1s not to be 
had What bearing does this fact have on our general strategy for 
protecting PSR? Let us count dimensions The standard phase space 
has a 3N dimensional configuration space and a 6N dimensional 
phase space, and 1s invariant under the six dimensional group of 
Euclidean symmetries It follows that the relative configuration space 
1s 3N-6 dimensional and the relative phase space (which includes the 
relative configuration variables and their rates of change) 1s 6N-12 
dimensional 

We can also construct a reduced phase space, by taking the quotient 
of the standard phase space by the action of the Euclidean group 


23 The canonical (cotangent bundle) symplectic structure 1s invariant under any 
transformation which ıs the lift of a diffeomorphism of the configuration space, the 
kinetic energy 1s Euchdean, deriving from the Euclidean structure of the space in 
which the particles move, the gravitational potential energy sees only relative 
distance between particles 

24 This program can also be motivated by relationalism about motion, see my 
“Geometry and Motion," forthcoming ın The British Journal for the Philosophy of 
Sctence Leibniz sometimes affirms the relativity of all motion, in the same texts, 
however, he also recognizes a notion of “absolute true motion,” to be founded upon 
a metaphysical notion of force Compare L V 52 and LV 53, see also Part II of 
“Specimen Dynamicum” in Loemker, ed, pp 444-50 
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The points of this space are equivalence classes of points of the 
standard phase space related by isometries, carrying a geometric 
structure inherited from that of the standard phase space ?5 Since the 
standard Hamiltonian 1s invariant under the group action, it projects 
down to a well-defined Hamiltonian on the reduced phase space The 
resulting dynamical theory captures all of the invariant content of the 
standard theory 

The reduced phase space ıs 6N- 6 dimensional, so this space ıs 
intermediate 1n size between the standard phase space and the rela- 
tive phase space In fact, we can think of it as being the result of 
augmenting the relative distances and relative velocities which pa- 
rameterize the relative phase space with six further variables These 
fall into two classes constants of motion and true dynamical variables 
Four of the six are constants of motion if we fix their initial values, 
then we are guaranteed that dynamical evolution will leave them 
fixed The other two variables are dynamical—like the relative posi- 
tion and velocity variables, we expect dynamical evolution to alter 
their values 

Three of the four constants of motion are redundant in the fol- 
lowing sense We can stipulate that they take some fixed value and 
examine the dynamical theory that results It will consist of a space of 
states, parameterized by the relational variables, the remaining con- 
stant of motion, and the two nonrelational dynamical variables This 
space will carry a geometric structure and a Hamiltonian. Neither 
depend in any way on which value we choose for our three constants 
of motion This 1s just to say that these constants of motion play no 
dynamical role m our theory—and hence that, from the point of view 
we are considering, they ought to be dropped from the reduced 
phase space, yielding a space of dimension 6N-9 (These three 
dynamically inert variable describe the velocity of the center of mass 
of the system, they can be eliminated by identifying points of the 
original or reduced phase space related by Galileian boosts ) 

The remaining nonrelational variables of the reduced phase space 
are very different I observed above that merely specifying the relative 
distances and velocities for our particles 1s insufficient to determine 
their future evolution— one also has to know something about the 
state of rotation of the system that they form The remaining constant 
of motion, R, corresponds to the magnitude of the angular momen- 


?5 The result is a stratified space, equipped with a Poisson structure adequate for 
the formulation of classical mechanics. See, for example, Juan-Pablo Ortega and 
Tudor Ratu, “Singular Reduction of Poisson Manifolds,” Letters ın Mathematical 
Physics, XLv1 (1998) 359-72 
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tum of the system, while the two nonrelational dynamical variables 
serve to fix a point on a sphere of radius R, encoding the direction of 
the angular momentum of the system 1n a frame rotating along with 
it If we fix R, we get a space of variables consisting of the relational 
variables and our two nonrelational ones The geometric structure 
and Hamiltonian on this space do depend on the value of R ?9 

These three additional nonrelational variables are required to 
handle the difference between rotating and nonrotating systems of 
particles—a difference that simply cannot be captured in the strictly 
relational framework of the relatve phase space, which lacks the 
resources to draw the dynamically crucial distinction between the 
various possible states of rotation of the system The reduced theory 
incorporates just enough variables to be able to recognize such 
differences—without running afoul of PSR by countenancing distinct 
physical possibilities that are related to one another by symmetries 
The nine variables included in the standard theory but absent from 
the reduced theory—the position and velocity of the center of mass, 
and the orientation of the system about its center of mass—are closely 
associated with the symmetries of classical mechanics They also play 
no dynamical role, and it ıs their elimination. which allows one to 
protect PSR 

Let me point out some other features of the reduced theory First, 
notice that although from the perspective of the standard theory the 
three remaining nonrelational variables bear a very close relationship 
to the total angular momentum of the system, we find that from the 
perspective of the reduced phase space they have no straightforward 
geometric interpretation Indeed, notice that we never fixed the 
value of N Thus, we need the same three nonrelational variables ın 
our reduced phase space no matter how large our system 1s This 
forces us to conclude that these three variables correspond to prop- 
erties of the system as a whole, rather than to properties of any 
particular particle Within the present framework—partcles moving 
in Euclidean space—it 1s very difficult to see how we can give a 
geometrically reasonable interpretation of these nonrelational varr- 
ables ?" 

Furthermore, notice that, 1f R-0—34f, from the traditional point of 
view, the angular momentum of the system vanishes—then the two 
remaining nonrelational dynamical variables drop out of the picture 


26 For details, see Toshihiro Iwai, “A Geometric Setting for Classical Molecular 
Dynamics,” Annales de l'Institut Henn Poincaré, Secuon A, 47 (1987) 199-219 

27 Neo-Newtonian space-time provides a more appropriate setting for interpret- 
ing the reduced theory, see my "Geometry and Motion is 
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(since they parameterize a sphere of radius zero) Thus, we end up 
with a theory on the relative phase space whose variables are Just the 
relative distances and velocities And, ın fact, the dynamics of this 
sector of the reduced phase space coincide with the dynamics for the 
relative distances and velocities which result if the relational initial 
data are arbitrarily embedded in the standard phase space, subject to 
the constraint that the angular momentum vanishes, and their evo- 
lution 1s read off from the evolution of the standard variables Since, 
as a matter of empirical fact, our universe has no discernible angular 
momentum, this strictly relational sector of the reduced theory is 
empirically adequate within the domain of Newton’s theory 28 


VII THE GENERAL CASE 
The example of the preceding section 1s typical of one large class of 
cases We begin with a configuration space for some set of particles or 
fields, and with a group of symmetries acting on that space, ın this 
case, we expect that a reasonable theory constructed on the phase 
space of the system will admit that same group of symmetnes When 
this occurs, we can—so long as some relatively weak technical condi- 
tions hold— construct an alternative theory on the reduced phase 
space which arises when we identify points of the original phase space 
related by symmetries We then find the followng (1) The reduced 
phase space includes variables in addition to those which arise when 
we begin by quotienting the original configuration space by its group 
of symmetries, and then augment the resulting set of relative config- 
uration variables with their rates of change These further nonrela- 
tional variables of the reduced theory fall into two groups— constants 
of motion and genuine dynamical variables 29 (2) These nonrela- 
tional variables are closely related to the conserved quantities that 
Noether's theorem associates with the symmetry group of the theory 
The number of nonrelational variables will be equal to the dimen- 
sonality of the group of symmetries In particular, it will be indepen- 
dent of the size of the system under consideration—suggesting that 
these additional variables ought to be thought of as properties of the 
system as a whole These variables are often implicated ın awkward 
problems of interpretation (3) Typically, the topology, geometry, 
and Hamiltonian of the reduced theory will be distinctly more com- 


?9 This sector very nearly coincides with the theory of Julian Barbour and Bruno 
Bertotti, "Mach's Principle and the Structure of Dynamical Theories," Proceedings of 
the Royal Soctety of London A, 382 (1981) 295-306, see my “Geometry and Motion” for 
references and further discussion 

?? [n the special case where the group in question ıs Abelian, all of the nonrela- 
tional variables of the reduced phase space are constants of motion 
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plex than that of the reduced theory Almost always, the reduced 
Hamiltonian will include potential energy terms corresponding to 
forces absent in the original theory (4) In the special case corre- 
sponding to the vanishing of the conserved quantities associated with 
our symmetries, we end up with a theory depending only upon the 
relative configuration and momentum variables When this fragment 
is empirically adequate, we are able to achieve a fully relational 
theory 

This gives us some 1dea of the costs and benefits of saving PSR from 
symmetries within one large and important family of classical me- 
chanical theories (similar remarks would apply to more general 
classes of theories) It turns out that devoted advocates of PSR can 
rest easy—we can (almost) always replace a theory admitting a sym- 
metry group by a mathematically reasonable quotient Others, 1n 
light of (3) above, will want to examine carefully the sacrifices re- 
quired on a case-by-case basis—the reduced theory may appear less 
than attractive qua physics beside its more familiar symmetric coun- 
terpart But one has to be on one’s guard here—for Judgments about 
what ıs natural and what ıs contnved are strongly conditioned by 
upbringing and familiarity 

It 15 only relatively recently that the reduction procedure we have 
been discussing was formulated 1n its full generality But it has roots 
in the nineteenth century Heinrich Hertz was very impressed by the 
possibility—analyzed in detail by Edward Routh—of eliminating 
those cyclic variables whose momenta are irrelevant to a given dynam- 
ical problem 9? Hertz hoped that the terms which this process intro- 
duces into the reduced Hamiltonian would provide one of the keys to 
eliminating the notion of force altogether from classical mechan- 
ics—he believed that the forces which appear in our theories of 
macroscopic phenomena are in part an artifact of our lack of knowl- 
edge of the true microscopic degrees of freedom 3! We have, he 
thought, effectively performed a reduction without realizing 1t—what 
we think of as being the proper form for a physical theory 1s in fact 
highly misleading, the result of an ontologically inappropriate reduc- 
ton 


30 For an elementary discussion, see Cornelius Lanczos, The Vanatonal Principles 
of Mechanics (New York Dover, 1986), §V 4 The standard textbook example is the 
Kepler problem—the motion of a point particle in a central force field—where the 
elimination of an angular variable results ın a theory with a force which varies 
inversely with 72, see V I Arnold, Mathematical Methods of Classical Mechanics (New 
York Springer, 1989), §2 8 

91 The Principles of Mechanics (New York Dover, 1956) For a helpful sketch of 
Hertz’s program, see Lanczos, Varational Principles, §V 5 
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(An aside about Hertz's program The idea was to start with a flat 
configuration space, then to reconstitute physics as we know it 
through the imposition of constraints on phase space and the elim- 
ination of cyclic variables All of physics would then follow from 
(Euclidean) kinetic energy and rigid connections between particles— 
force would play no role The program 1s sometimes dismissed by 
philosophers as hopeless—What mechanism of hidden microscopic 
degrees of freedom could account for the gravitational force? But, in 
fact, 1t 1s all too easy to implement technically 3? The problem 1s 
rather that known implementations have little. discernible physical 
interest The kinetic energy in question bears no straightforward 
relation to any physical notion ?? Even worse for almost any physical 
system, ıt suffices to add a single hidden cyclic degree of freedom ) 

In a similar vein, our theory of the behavior of a charged particle 
in an electromagnetic field—which has the particle tracing out a 
curved trajectory 1n four-dimensional space-time 1n accord with the 
Lorentz force law—strikes us as being entirely natural But this theory 
can be thought of as arising via reduction from a Kaluza-Klein theory, 
in which space-time has a fifth (minuscule and circular) dimension 
and the charged particle moves along a straight line ?* Here the 
question of which 1s more natural or attractive 1s a difficult one—and 
we might well expect our intuitions to be hostage to the success of 
Kaluza-Klein-nspired strategies in string theory and elsewhere 

My point is this while it 1s quite true that we have no guide here 
other than the sum-total of our judgments of simplicity, elegance, 
and so on, we must be willing to interrogate this guide rather care- 
fully, and with as much self-consciousness as we can muster 


VIII. METHODOLOGICAL WORRIES 
Very often, then, when presented with a physical theory invariant 
under a group of symmetries, we have the option of replacing 1t by a 


?? Almost any finite-dimensional classical theory can be handled using some 
sequence of standard tricks and results Handy ingredients include Theorem 3 15 
of Arnold, V Kozlov, and A Neishtadt, Mathematical Aspects of Classical and Celestial 
Mechanics (New York. Springer, 1997), Theorem 2 of Mark Gotay and Gys Tuyn- 
man, “R?” ıs a Universal Symplectic Manifold for Reduction, " Letters ın Mathematical 
Physics, xvin (1989) 55-59, the trick deployed on p 43 of Guillemm and Sternberg, 
Variations on a Theme by Kepler (Providence, RI American Mathematical Society, 
1990), the embedding theorems of Mikhael Gromov, Partial Differential Relations 
(New York Springer, 1986), 883 1 and 33 

33 I am here indebted to Howard Stein’s criticism of a related program of Henri 
Poincaré in his unpublished paper “Physics and Philosophy Meet The Strange Case 
of Poincaré ” 

34 In this case, charge appears as the momentum of the particle along the fifth 
dimension, see, for example, Marsden, Lectures on Mechanics, 887 5 and 76 


THE PRINCIPLE OF SUFFICIENT REASON 73 


reduced theory with the same power Ought we to do so? It 1s often 
asserted that symmetries are crucial in modern physics they underlie 
conservation laws, they enforce relativity principles, they guide theory 
construction Can we afford to sacrifice these virtues? 

There 1s much to be said here But the short answer is that no real 
sacrifices are required Conservation laws are enforced in the re- 
duced theory, just as in the original theory—indeed, they are en- 
coded in the geometry of the reduced phase space in a rather 
interesting way As for relativity principles and other heuristics, note 
that there 1s a sense ın which the symmetry group is still present, in 
a more subtle form by looking at the local geometry of the space of 
states, one can divine which group, if any, was factored out in its 
construction So nothing need be lost 1n moving to the reduced 
theory 

But there 1s something to be gained Invariance of a theory under 
(continuous groups of) symmetries implies that the phase space of 
the theory includes dynamically inert variables (since the action of 
the group on a point of phase space carries us to a point where the 
dynamics look exactly the same) By moving to the reduced phase 
space, we avoid the sort of redundancy in our space of possibilities 
which underwrote Clarke's attack on PSR, without losing any of the 
(invariant) content of our dynamics 


IX PSR IMPLIES PR? 

I want to discuss briefly one final topic the logical relationship, 
within the present context, between my version of PSR and Hacking’s 
version of PII I mentioned above that the construction of a reduced 
phase 1s contingent upon some technical conditions One that 1s 
often imposed requires that the identity be the only symmetry that 
has any fixed points This guarantees that the reduced phase space 
will be a manifold More generally, this will follow so long as all of the 
points of our original phase space manifest the same degree of 
symmetry ?9 But when this fails, the reduced phase space will not be 
a manifold generic points of the original space, corresponding to 
minimally symmetric arrangements of matter, will project down to 
generic, well-behaved points of the reduced phase space, but the 
reduced phase space will also have singularities that correspond to 
those points of the original space which represent more symmetrical 
arrangements of matter 


35 This can be made precise, see Alan Weinstein, "The Local Structure of Poisson 
Manifolds,” Journal of Differential Geometry, xvi (1983) 523-57 

36 That 1s, we require that the isotropy subgroups associated with each point of 
our phase space be conjugate to one another within the symmetry group 
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Now, this seems to mean that enforcing PSR (in my sense) implies 
PII (in Hacking’s sense), since if one wants a nonsingular reduced 
phase space, one must exclude symmetric arrangements of matter 
from one’s original space of physical possibilities And this seems 
meet and nght, since Leibniz often proclaimed that his entire phu- 
losophy, ıncludıng lesser prıncıples such as PII, rested ultimately 
upon the Principles of Noncontradiction and Sufficient Reason 87 

Alas, the technical situation 15 more complex than this It 1s not so 
much that the reduced theory makes no sense near singularities—the 
reduced phase space is well-behaved topological space, carrying 
enough geometric structure to underwnte dynamics—as that one 
must be more careful in formulating it, since the structure of the 
tangent space of the reduced phase space changes discontinuously as 
one moves from generic points to singular points 38 But perhaps one 
could sull cobble together a Leibnizean argument from PSR to PII via 
the Law of Continuity (which might be thought to prohibit such 
Jumps)??? Or perhaps this last anachronistic epicycle 1s one too 
many 

GORDON BELOT 
New York Unversity 


37 See, for example, LII1 and LV21 On this issue, see Cover and O'Leary- 
Hawthorne, chapter 5 

38 This picture has been established for finite dimensional systems (1n addition to 
Ortega and Ratiu, “Singular Reduction,” see Reyer Sjamaar and Eugene Lerman, 
“Stratified Symplectic Spaces and Reduction," Annals of Mathematics, CXXX1v (1991) 
375-422), and for some important infinite dimensional systems (James Isenberg and 
Marsden, "A Slice Theorem for the Space of Solutions of Einstein's Equations," 
Physics Reports, LXXXIX (1982) 179-222, Jedrzej Sniatycki, Günter Schwartz, and 
Larry Bates, “Yang-Mills and Dirac Fields in a Bag, Constraints and Reduction,” 
Communications in Mathematical Physics, CLXXv1 (1996) 95-115) 

39 Leibniz deployed this Law in rejecung laws of nature proposed by Descartes 
and Malebranche, see Loemker, ed , pp 351-53 and 397-403 
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CAUSES AS PROBABILITY RAISERS OF PROCESSES* 


ausation, according to David Hume, 15 one of the three 

fundamental conceptual relations (along with resemblance 

and contiguity), and 1s the foundation of all reasoning con- 
cerning matters of fact Causation, according to various contempo- 
rary philosophers, 1s required for the analysis of metaphysical 
concepts such as persistence, scientific concepts such as explanation 
and disposition, epistemic concepts such as perception and warrant, 
ethical concepts such as action and responsibility, legal concepts such 
as homicide and negligence, mental concepts such as functional role 
and conceptual content, and linguistic concepts such as reference, to 
name just a salient few Yet the nature of the causal relation itself 
remains mysterious 

The leading accounts of the nature of causation divide into proba- 
biltty-rarsing and process-himkage views On the probability-raising view, 
causation 1s rooted in the comparative probability of the effect with 
the cause versus without On the process-linkage view, causation 1s 
rooted in the existence of a connecting line from cause to effect I 
propose a third alternative which synthesizes these views while solving 
their problem cases On this alternative, causation 1s rooted in the 
comparative probability of the connecting line to the effect with the 
cause versus without causes as probability raisers of processes 
In the first two sections, I (briefly) outline the probability-raising 

and process-linkage views and identify a space of problems, and then 
I focus on developing the alternative probability-raisers-of-processes 
solution I identify problems for this view as well, though I conclude 
that 1t 1s at least m important respects a step forward 


I PROBABILITY-RAISING AND PROCESS-LINKAGE VIEWS 


The probability-raising view of causation has 1ts roots m the social 
sciences and 1n the philosophical analysis of causation 1n terms of 
necessary and/or sufficient conditions In the social sciences, issues 
concerning causation are addressed by the use of algorithms that seek 
covariations in frequency data, and then sort the causal covariations 


* Thanks to David Coady, Cian Dorr, Harty Field, Ned Hall, Chris Hitchcock, 
Barry Loewer, Tim Maudlin, Michael McDermott, Brian McLaughlin, Ram Neta, L 
A. Paul, and especially to David Lewis 
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from the merely correlational ones 1 Many social scientists have op- 
erationally identified the causal relation with these algorithms This 1s 
a statistical conception of causation 

The probability-raising view also represents the indeterminism- 
mduced synthesis of philosophical accounts of causes in terms of 
necessary and/or sufficient conditions for their effects The natural 
extension of the necessary condition view into the mdetermunistic 
case 1s that causes are necessary, not for the presence of their effects 
(since the effect might still retain some nonzero chance of sponta- 
neous eruption even without the cause), but for a higher probability 
of their effects Thus, for David Lewis,? causation 1s rooted in the 
counterfactual claim that, had the cause not occurred, then the 
chance of the effect would have been less, and D H Mellor? requires 
that Che(E) > Ch_c(E), which measures the chance that the actual 
circumstances give E versus the counterfactual chance that the cr- 
cumstances minimally changed with respect to C would give E The 
natural extension of the sufficient condition view into the indeter- 
munuistic case 1s that causes are sufficient, not for the presence of their 
effects (since the effect might still retain some nonzero chance of 
fizzing even with the cause), but for a higher probability of their 
effects Thus, for Patrick Suppes,* causation 1s rooted in the statistical 
claim that the limiting frequency of effect-type events 1s greater with 
cause-type events than without p(E|C) > p(E|AC), and Ellery Eells® 
requires that the chance of E increase to a high value just after the 
time of Cand remain high until the time of E And so counterfactual 
dependence flows into counterfactual chance dependence, constant 
conjunction flows into comparatively frequent conjunction, and the 
two merge into the wider seas of probability raising 

The process-lnkage view of causation has its roots m physics and in 
the philosophical analysis of causation ın terms of physical connec- 
tions rather than abstract conditions Physicists have long suggested 
that flows of energy momentum, or something of that sort, are the 
basis for actual causal connections ê Many philosophers have rejected 


! Peter Spirtes, Clark Glymour, and Richard Schemes offer a sophisticated algo- 
rithmic approach in their Causation, Prediction, and Search (New York Springer, 
1993) 

? “Causation,” in Philosophical Papers, Volume II (New York Oxford, 1986), pp 
172-213 

3 The Facts of Causation (New York Routledge, 1995) 

* A Probabilistic Theory of Causality (Amsterdam North Holland, 1970) 

5 Probabilistic Causality (New York Cambndge, 1991) 

9 Wladyslaw Krayewski—“Energetic, Informational, and Triggering Causes,” 
Erkenntnis, x1 vui (1997) 193-202—traces energy flow accounts of causation back to 
Robert Mayer (the discoverer of the law of conservation of energy) 1n 1842 
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the necessary and/or sufficient conditions approach to causation m 
favor of physical connections David Fair’ identifies actual causation 
with energy flow, Wesley Salmon® describes causal processes as those 
connections capable of propagating their own structure (transmitting 
a mark), and Douglas Ehring® requires the persistence of a trope 
from cause to effect Think of the path of energy flow, the wave of 
structural propagation, and the worldline of the persisting trope as 
constituting a physical connecting line from cause to effect 

I propose to separate the questions of how the probability-raising 
and process-linkage relations are best analyzed (which I have dis- 
cussed at length elsewhere), from the question (which I focus on 
here) of how the causal relation should then be analyzed In the 
remainder of this section, I sketch my preferred analyses of proba- 
bility raising and process linkage, and in what follows I focus on the 
analysis of causation 

Probability raising ıs best analyzed, I think, ın terms of Lewis’s 
counterfactual chances, on which Cs a probability raiser of E if and 
only if (1) Cand E are actual distinct events, (u) ch(£)-at-tc=p, and 
(ui) 4C O> ch(E)-at-i_c<p In saying that Cand Eare actual distinct 
events, I mean that each 1s a (Property, Region) ordered pair whose 
first member is actually instantiated at its second, with the C and E 
properties logically independent and the C and E regions spatiotem- 
porally disoint In saying that ch(E)-at-tc— f, I mean that the propen- 
sity (objective single-case chance) of E in the circumstances at the 
ame of Cis p In saying that C O> ch(£)-at-t_¢<p, I mean that the 
counterfactual supposition of the nonoccurrence of C entails that the 
propensity of E 1n those same circumstances with C deleted would 
have been less than p 1° 

Process linkage 1s best analyzed, I submit, ın terms of pairwise 
nomic subsumptions processes are lawful sequences 11 Not just any 


7 “Causation and the Flow of Energy,” Erkenntnis, xzviu (1979) 219-50 

8 Scientsfic Explanation and the Causal Structure of the World (Princeton University 
Press, 1984) In his Causahty and Explanation (New York Oxford, 1998), Salmon 
responds to a suggestion from Phil Dowe (for whom causal processes are the 
worldlines of conserved-quantity-bearing obyects—“Wesley Salmon’s Process Theory 
of Causality and the Conserved Quantity Theory,” Philosophy of Scwnce, LIX (1992) 
195-216), by moving to a conserved-quanuty transmission view of causal processes 
This remains squarely within the process-linkage camp 

9 Causation and Persistence (New York Oxford, 1997) 

10 For further discussion, see Lewis as well as Mellor 

11 David Armstrong proposes “A law (an entity in the world) provides the ground 
in nature for an inference The cause 1s like the antecedent”—“The Open Door 
Counterfactual versus Singularist Theories of Causation,” in H Sankey, ed , Causa- 
tion and Laws of Nature (Boston Kluwer, 1999), pp 175-85, here pp 183-84 
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lawful sequence will do The antecedent must be prior to the conse- 
quent, and the lawful connection must not be revealed to be merely 
correlational by a more fundamental law More precisely, Cs directly 
process linked to E if and only if (1) Cand Eare actual distinct events, 
(11) there ıs a law L of the form (VR) (VR’) ((P, R)O(P’, R')) such that 
Cis an 1nstance of the antecedent and E of the consequent, (11) there 
1s no law L' more fundamental than L such that actual prior event A 
1s an instance of the antecedent and C&E is an instance of the 
consequent, and (iv) t¢<t; In speaking of laws as having the form 
(VR) (VR’) ((P, R)OCKP', R’)), I take the quantifiers to range over 
regions, and the laws to relate the properties of one region to those 
of another Thus events, understood as (P, R) pairs, can instantiate 
the antecedents and consequents of laws Process linkage in general 
1s then defined from the direct case C 1s process linked to E if and 
only if there ıs a chain of direct process links between C and E12 

So when a brick 1s thrown through a window, on the lawful se- 
quence analysis of process, there ıs a sequence of events from the 
throwing of the brick, through its intermediary trajectories, to the 
shattering of the window, and the events in this sequence will be 
pairwise lawfully related (and not merely covanants, and rightly 
temporized) The initial conditions of the brick being here with a 
certain momentum at £5 will lawfully entail that the brick flies to there 
at h And since energy flows are a this-worldly instance of lawful 
evolution, the lawful subsumption analysis both undergirds the idea 
of, and inherits the plausibility of, Fairs empirical identification of 
processes with energy flows 

I clam that however the probability-raising and process-linkage 
relations are interpreted, causation cannot be understood by proba- 
bility raismg or process linkage alone, but rather should be under- 
stood by a synthesis of these relations on which causes are probability 
raisers of processes Since the discussion that follows 1s relatively 
independent of any specific understanding of probability raising and 
process Imkage,'? for the sake of generality, I speak of ‘probability 
raising’ and ‘process linkage’ without further mterpretation (unless 
otherwise indicated) The reader 1s invited, though not required, to 
understand these notions as I do 


12 For more details, see my "Processes as Lawful Sequences” (unpublished) 

13 I do assume that (1) both the probability-raising and process-linkage relations 
are asymmetric, (11) the probability-raising relation 1s circumstance sensitive, and 
(mi) the process-inkage relation extends to worlds with different laws than ours 
(and so 1s not tied into any contingent details of actual physics) 
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Supposing we have some notion of probability raising and some 
notion of process linkage in hand, how (if at all) can causation be 
analyzed? 


Il THE PROBLEM SPACE 

Causation cannot be analyzed purely in terms of probability rais- 
ing or purely in terms of process linkage, however these notions 
are interpreted and however they are refined There are counter- 
examples to these approaches which extend to even their most 
sophisticated refinements These counterexamples interlock to 
form a problem space that no extant account of causation can 
navigate 

(1) Causation without probability raising (preemption, and its trump- 
mgvariant) Pam throws a brick through the window Meanwhile, Bob 
(a more reliable vandal), holds his throw on seeing Pam in action, 
though had Pam not thrown Bob would have !* 


Pam ~_——> 1 
a. 30 Window shattering 


Bobo 4o 


DIAGRAM 1 


Preempting Pam's throw 1s obviously a cause of the window shat- 
tenng But her throw is a probability lowerer of the shattering since 
Bob 1s a more reliable vandal, the window’s chances would have been 
worse with Bob in action Thus probability raising 15 not necessary for 
causation 15 

Probability-raising theorists are well aware of preemption and 
have responded with an extensive literature of refinements These 
refinements divide into two general strategies factor intermedianes 
and/or require precision intermediary-based approaches look to 


14 Diagram conventions filled circles doubly represent neurons that fire and 
events that occur, unfilled circles doubly represent neurons that do not fire and 
events that do not occur, arrows represent stimulatory connections, dots inhibitory 
connections, and numbers probabilistic connections of varying strengths If two 
neurons are connected by a nonprobabilistic stimulatory connection and the first 
fires, the second will fire unless inhibited 

15 For further discussion of these cases, see Lewis, I J Good, “A Causal Calculus 
I,” British Journal for the Philosophy of Science, X1 (1961) 305-18, and Peter Menzies, 
“Probabilistic Causation and Causal Processes A Critique of Lewis,” Philosophy of 
Science, LVI (1989) 642-63 
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intermediary events or probability evolutions to try to find a chain 
of causations from preemptor to effect, and precision-based ap- 
proaches look to the exact tıme and manner of the events involved 
to try to find differences in the effect attributable to the preemp- 
tor 16 

There are, however, variants of preemption (trumpong) with no 
intermediaries to factor or differences to be precise about, which 
immediately undercut all extant probability-raising-based strategies 
against preemption 

The silver-tongued sergeant and mumbling major both order 
‘Charge!’ and so the corporal decides to charge The major's order is 
a cause of the corporal’s decision, as can be seen from the fact that 
ranking orders trump when orders conflict the corporal will follow 
the major, so when they accidentally converge the same should hold 
But the major's order 1s a probability lowerer of the corporal’s 
decision the probability of the corporal deciding to charge in those 
circumstances would have been greater had the major left it to the 
silver-tongued sergeant to give the order since the major 1s more 
likely to be misheard !7 

It is an empirically open question (concerning exactly how the 
mind works) whether there are any relevant intermediaries be- 
tween order and decision, and/or any precisifiable differences in 
the decision attributable to the major certainly, the corporal need 
not be conscious of either And so it 1s possible to have preemption 


16 Examples of intermediary-based approaches include Lewis's original approach 
which takes the transitive closure of probabilty-raismg, Menzies’s requirement of 
temporally conunuous chains of probability raisings, Eells’s use of probabihty 
trajectories through the (C, E) temporal interval, and Igal Kvart's constructions 
based on further conditionalization on temporally intermediate events These 
1ntermediary-based approaches are offered, respectively, 1n Lewis, Menzies, Fells, 
and Kvart, “Cause and Some Positive Causal Impact,” m J Tomberlin, ed , Philo- 
sophical Perspectives 11 Mund, Causation, and World (Cambridge Blackwell, 1997), pp 
401-32 

Examples of precision-based approaches include Lewis’s fragility proposal on 
which probability raising is assessed with Cand E required to have their exact actual 
times and manners, L A Paul's time-tracking emendment on which C's being a 
probability raiser of Eis disjomned with C's being a probability raiser of Eat-£ and 
Deborah Rosen’s suggestion that probability raisings that disappear under precisi- 
fication are spurious These precision-based approaches are offered, respectively, in 
Lewis, Paul, “Keeping Track of the Time Emending the Counterfactua: Analysis of 
Causation,” Analysis, Lvim (1999) 191-98, and Rosen, “Discussion In Defense of a 
Probabilistic Theory of Causality,” Philosophy of Science, xiv (1978) 604-13 

17 The probabilistic rmpact buttresses the causal claim, since the chance of the 
corporal deciding to charge ıs keyed into the chance of him correctly understand- 
ing the major's mumbhng (and is independent of the chance of him grasping the 
sergeant’s order) 
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without intermediaries to factor or differences to be precise 
about 18 

(2) Probability raismg without causation (fzling, and its overlap- 
ping variant) Fred grabs a brick and aims Pam, oblivious to Fred, 
throws a brick through the window Perhaps she preempts Fred, or 
perhaps Fred quits on his own, or perhaps he throws but wide 
Suppose that at the me Fred aimed there was some non-one chance 
of Pam succeeding and some nonzero chance of Fred succeeding 
without Pam (The preemption diagram above represents just such a 
case, with Bob as the fizzler ) 

Fizzled Fred's aiming 1s obviously not a cause of the window shat- 
tering But his aiming 1s a probability raiser of the shattering, given 
Pam’s non-one and Fred’s nonzero chance of success at the time 
when Fred aimed the window’s prospects were worse Thus probabil- 
ity raising 1s not sufficient for causation 1° 

Probability-raising theorists have only recently become aware 
of fizzings, but the same general strategies of factoring inter- 
mediaries and/or requiring precision suggest themselves There 
are, however (as with preemption), variants of fizzling (overlap- 
ping) with no intermediaries to factor or differences to be precise 
about 

Suppose that an atom of U238 and of Ra226 are placed in a box 
at tọ and that at £j there 1s an atom of Th234, an alpha particle, 
and (still) an atom of Ra226 1n the box The presence of the 
radium at £j 1s not a cause of the presence of an alpha particle at 
t, Since Ra226 decays into Rn222 plus an alpha particle, it 1s 
evident from the absence of Rn222 (and the continuing presence 
of the Ra226) that the alpha particle was not produced by radium 
decay, but rather by the independent process of uranium decay ın 
which U238 decays into Th234 plus an alpha particle But the 
presence of the radium 1s a probability raiser for there being an 
alpha particle, as follows from the lawful probability A’ of radium 
decay 

Since particle emissions from radioactive sources (as standardly 
understood) occur spontaneously and directly, there will be no hid- 
den intermediaries to factor, and if at least one of the particles 1s in 
a superposition of location so that the two are “smeared” over a 
common region, then there can be no differences at all in the alpha 


18 For a full discussion of these cases, see my "Irumping Preemption,” this 
JOURNAL, XCVII, 4 (April 2000) 165-81 

19 For more on these cases, see Menzies, and Christopher Read Hitchcock, “Do 
All and Only Causes Raise the Probabilities of Effects?" (unpublished) 
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O Th234 
U238 O 
— ?O Alpha 


Ra226 ~— 7’) 
x = >o Rn222 


DIAGRAM 2 


emission projected from either source So no manner of factoring 
intermedianes and/or requiring precision will help Thus probability 
raising, even when refined to factor intermediaries and/or require 
precision, 1s not sufficient for causation 2° 

So far, 1t seems as if the causal relation tracks the process-linkage 
relation rather than the probability-raising relation The preempting 
cause 1s connected to the effect and the fizzled noncause 1s not (just 
look at the diagrams!) But there are also cases where the causal 
relation looks to track the probability-raising relation rather than the 
process-linkage relation 

(3) Causation without process linkage (disconnections) the plane is 
on a collision course with the mountain The air traffic controller 1s 
about to alert the pilot to turn, when the saboteur destroys the radio 
tower, and so the alert ıs not transmitted, and so the plane crashes 


Sabotage © © Crash 
Alert DL D id 
Flight o 


DIAGRAM 3 


The sabotaging 1s obviously a cause of the crash, which intuition 
may be buttressed by the following considerations Statistically, sabo- 
tages in these circumstances will be universally followed by crashes 
Counterfactually, had the tower not been sabotaged then the crash 
would not have occurred The explanation for why the plane crashed 
will surely include the sabotage Knowing that the sabotage occurred 


?? For further discussion, see my “Overlappings Probability-Raising without Cau- 
sation,” Australasian Journal of Philosophy, xxvi (2000) 40-46 
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will license an inference to the crash Sabotages in these circum- 
stances are an effective means for crash-hungry agents The saboteur 
will be held morally responsible for the crash In short, all the 
connotations of causation are in full force 

But as the diagram suggests, no process links sabotage to crash No 
relevant energy-line/structure/trope or whatnot persists between 
them, and no lawful sequence links them Rather, what 1s salient here 
15 the absence of a process If only there had been an alert transmitting 
process connecting tower to pilot, the crash might well have been 
prevented, but the sabotage prevented that ?! 

Process-linkage theorists appear to be largely unaware of these 
cases, or at least unaware of how ubiquitous and paradigmatically 
causal they are Some process-linkage theorists have (for indepen- 
dent reasons) refined their semantics by covering absences and/or 
conjoing probability rasıng absence covering approaches add 
counterfactuals about processes to cover the case of cause and/or 
effect being an absence (=C/—E), and probability-raising conjunc- 
tions require process linkage and probability raising ?? But cover- 
ing absences will not help since the disconnector and effect are 
both presences, and conjoming probability raising (or anything 
else) will not help since we already have causation without process 
linkage Thus process linkage, even when refined to cover ab- 
sences and/or to require probability raising, 1s not necessary for 
causation 

(4) Process linkage without causation (traces) Pam throws the brick 
through the window while innocent Tom watches 1n dismay, or eats 
a sandwich, or contemplates fate Obviously, there 1s no causation 


21 Disconnections are present in even the most paradigmatic causal sequences 
Gus gets mad (C), pulls the trigger (Di), fires a bullet through Vic's heart (D5), and 
Vic dies (E) Here ıs causation if anything ıs But if one examines how muscles 
contract, how guns fire, and how heart injuries kill, one will find disconnections at 
every step For instance, heart injuries kill not by a connecting process, but rather 
by disconnecung what would otherwise have been a process, namely, the pumping 
of oxygenated blood to the brain For further discussion, see Ned Hall, "Two 
Concepts of Causation” (unpublished), Lewis, "Void and Object” (preprinted by 
the Department of Philosophy, University of Melbourne, 1998), and my "Causation 
by Disconnection” (forthcoming in Philosophy of Science) 

22 Fair and Dowe both offer absence-covering extensions (although Dowe rele- 
gates the extension to the ersatz status of “causaton*”), and Ehnng as well as 
Hector-Nen Castañeda both conjoin the process-linkage and probability-raising 
approaches These refinements appear, respectively, in Fair, Dowe, Physical Causa- 
tion (unpublished), Ehring, and Castañeda, “Causes, Causity, and Energy," 1n Peter 
French, Thomas Uehling, Jr , and Howard Wettstein, eds , Midwest Studies in Philos- 
ophy, Volume IX Causation and Causal Theores (Minneapolis Minnesota UP, 1984), 
pp 17-28 
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from Tom’s watching or whatever else to the window shattering if 
ever there ıs a lack of causation, it 15 here But there are presumably 
lots of processes connecting Tom to the shattering Photons and 
sound waves and dust motes and other mere traces of Tom’s watching 
connect these events 

The obvious thing to say about traces ıs that they are relevant 
connections But relevance constraints (which counterfactuals ex- 
press so naturally) are just what the process-linkage semantics fore- 
goes ?? 

Covering absences clearly will not help since Cand Eare presences 
here Conjoming probability raising will help (assuming the proba- 
bility-raising relation in use does not require complete precision), but 
at the cost of newfound failure in preemption cases since, as shown 
above, preemptions involve causation without probability raising 
And conjoming probability raising will not even secure sufficiency if 
the trace comes from a fizzler fizzled Fred’s aiming 1s both a proba- 
bility raiser of the window shattermg and (presumably) connected to 
the shattering by various stray photons, but 1s still not a cause Thus 
process linkage, even when refined to cover absences and/or to 
require probability raising, 1s not sufficient for causation 

In disconnections and traces it seems as if the causal relation tracks 
the probability-raising relation rather than the process-linkage rela- 
tion The disconnector 1s a probability raiser while the trace noncause 
is not (the sabotage makes the crash more hkely while the stray 
photon does not make the window shattering more hkely) 

The four problem cases rendered above mterlock to form a single ' 
problem space that no extant account of causation can navigate 


Problem Space the preempung cause (1) and trace noncause (4) are both 
process linked to, but not probability raising of, the effect in question 
The disconnecting cause (3) and fizzled noncause (2) are both proba- 
bility raising of, but not process linked to, the effect in question 


An adequate account of causation must explain the causally relevant 
difference between preemptors and traces, and between disconnec- 
tors and fizzlers (vandals and saboteurs are causes, while failed van- 


?? Salmon, whose theory of explanation was subjected to a related critique by 
Hitchcock (in “Discussion Salmon on Explanatory Relevance,” Philosophy of Science, 
LXII (1995) 304-20), has conceded this point for explanation “As a 1esult of 
Hitchcock's analysis, I would now say that connecting causal processes, in the 
absence of statistical relevance relations, also lack explanatory import”—“Causality 
and Explanation A Reply to Two Cntques,” Philosophy of Science, uvix (1997) 
461-77, here p 476 Y would add that, given how entwined causation and explana- 
tion are (especially for Salmon), the same should hold for causation 
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dals and bystanders are not) Neither the probability-raising nor the 
process-linkage relation, however interpreted and however refined, 
can reveal the nature of causation on their own 


III PROBABILITY RAISERS OF PROCESSES 
Causation, I here propose, 1s rooted in the comparative probability of 
the connecting line to the effect with the cause versus without This 
approach synthesizes the probability-raising and process-linkage views 
and successfully navigates the problem space mapped above. 
For starters, consider 


Analysis 1 C causes E if and only if Cis a (P)robability (R)aiser (O)f a 
(P)rocess for E (for short C ıs a PROP for E) 


The idea is that, to assess if C causes E, first 1denüfy the actual 
process(es) linked to E, and second assess the probabilistic relevance 
of C, not to E, but to the comparatively extended event that comprises 
the E-process (or one of them im overdetermmations with muluple 
processes) The E-process, for a given candidate cause C, 1s to be 
traced back to the time of C, and may or may not include C In worlds 
in which the causal order ıs the temporal order, the E-process runs 
the closed interval [45,45], and ın most actual world cases the E-pro- 
cess may be thought of as the energy lines in [tç,tg] converging on E 
(an E-process 1s thus not the “causal intermediaries” between C and 
E) 

Given the counterfactual chance understanding of probability ras- 
ing and the nomic subsumption understanding of process linkage 
sketched above, Analysis 1 may be further interpreted as 


Analysis 1 Interpreted C 1s a PROP for E if and only if (1) there ıs an 
extended event ‘Eline’ containing actual distinct events (C', Dj, Da, , 
D,, E) 1n pairwise nomic subsumption relations, (11) there ıs an actual 
event Cat f,' which 1s distinct from D,, D, , D,, and E (C may or may 
not be distinct from C’), (m) ch(£1ine)-até,=p, and (vw) ~C LI» 
ch ((E-lne))-at-t |; p 


This interpretation of Analysis 1 1s compatible with a purely "Hu- 
mean" metaphysic of just occurrent properties and space-time, given 
a prior Humean account of laws, chances, and counterfactuals Of 
course, other interpretations are available even the causal primitivist 
may accept Analysis 1 as revealing how the notions of probability and 
process derive from primitive causation 

Analysis 1 synthesizes the probability-raising and process-hnkage 
views Analysis 1 may be thought of as a development of the precision 
strategy for the probability-raising view Lewis considers precision 
with respect to time, precision with respect to manner, and precision 
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with respect to causal origins He rejects precision with respect to 
tme and/or manner as oversensitive to spurious influences, and 
rejects precision with respect to causal origins as trivializig the 
account of causation But there 1s a fourth possibility (which is 
neither oversensitive nor trivializing) precision with respect to lawful 
origins If we think of E as having its lawful origins essentially, then E 
exists if and only if Eline exists It would then follow that C ıs a 
probability raiser of Eine 1f and only ıf C ıs a probability raiser of 
E-with-1ts-actual-lawful-origins ?* 

Analysis 1 may also be thought of as an extension of the absence- 
covering strategy for the process-linkage view Fair, who saw that 
absences can be causal, extended his view by considering counterfac- 
tuals about processes 


(1) C causes E 1f and only if (a) C and E occur, and (b) C1s physically 
connected to E 

(2) C causes ~E if and only if (a) C occurs and E does not, and (b) Cis 
physically connected to an E' incompatible with E 

(3) ~C causes E 1f and only if (a) C does not occur and E does, and (b) 
had C occurred it would have been physically connected to an E’ 
incompatible with E 

(4) aC causes ~E if and only if (a) neither C nor E occurs, and (b) had 
C occurred 1t would have been physically connected to E 


In the case where E 1s an absence, think of the E’ incompatible with 
Eas the manifestation of E's absence 1f E1s the absence of beer 1n the 
fndge, 1ts manifestation 15 the fridge actually being stuffed with milk, 
grapes, and air If we think of the (Property, Region) pair of absent Es 
as (Manifestation, Region), then Analysis 1 can be seen as both a 
unified generalization over, and 1ndeterministic extension of, Fair’s 
four cases, all of which can now be seen to relate how the E-manifes- 
tation process depends on whether or not C occurs 

Thus Analysis 1 is a true synthesis of the probability-raismg and 
process-linkage approaches, in that it not only explicitly incorporates 
both notions, but also extends the leading refinements of these 
approaches to convergence 

Analysis 1 also explains the success of counterfactual accounts of 
causation In the deterministic case, 1f C and E both occur and E 
counterfactually depends on G, then it follows that whatever process 


24 Analysis 1 ıs still preferable to lawful-origin precision, since the latter makes 
dubious claims about event essences (though perhaps the essentialıty of lawful 
origins of events 1s defensible on analogy with the essentiality of parental origins of 
persons), and obscures just how much of the work of the theory consists in spelling 
out an appropriate account of lawful origins 
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actually produced E must depend on C (else E would still occur 
absent C via this independent process), so C is a PROP for E In the 
other direction, 1f C 1 a PROP for E and there 1s no other E-process 
(no redundant causation) then E will counterfactually depend on C 
Thus Analysis 1 explains why, at least in the deterministic case, the 
counterfactual account of causation proves such an excellent ac- 
count, except for the seemingly intractable difficulties created by 
redundant causation (overdetermination and preemption) 

Analysis 1, moreover, yields a unified and informative solution to 
the problem space rendered above Consider a given candidate cause 
C If the E-process includes C, then Cis a PROP for E 1f and only if C 
1s an essential part of the E-process without C, if E sull occurs 1t 1s via 
a different process entirely, rather than the same process slightly 
altered If the E-process does not include C, then Cis a PROP for E if 
and only if Cis a shield for the process without C, the chance of that 
same process running to completion would have been less Analysis 1 
1s satisfied by essential parts of, as well as shields for, processes With 
this the problem cases above all fall into line 

In preemptons, the preemptor comes out a cause because it 1s an 
essential part of the Eprocess without the preemptor, even though E 
still occurs 1t ıs by a different process entirely, namely, by the backup 
process In the case of Pam and Bob, the window-shattering process 
(assessed from the time of Pam's throw) 1s the sequence of events of 
her throwing the brick, her brick flying ever closer to the window, 
hitting 1t, and producing waves of force that shatter 1t Pam's throw 1s 
a part of this process and an essential part, since without it if the 
window gets shattered at all ıt will be by a different process entirely 
Bob’s process This solution extends to trumpings, given a suitably 
general conception of process to appreciate that the trumper (major) 
1s on the process to the effect and the backup (sergeant) not ?* 

Traces, however, come out noncauses because they are inessential 
parts of the E process without the trace, E still occurs by the same 
process slightly altered Thus, without the traces of Tom's watching, 
the process by which Pam throws her brick through the window 
would be in essence the same, minus a few irrelevant stray photons ?9 


25 Applying the lawful sequence analysis of processes there are trumping laws 
linking ranking orders to decisions Since the major’s order ıs the ranking order, 
only the major's order, not the sergeant's, instantiates the antecedent The corpo- 
ral's decision instantiates the consequent 

?6 We get along perfectly well without a defined counterpart relation for events, 
persons, tables, and the like, so in this respect processes are on par with the other 
furniture of the world Our robust intuitions serve us sufficiently well 
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There is a continuum of cases between traces and genuine causes 
(just imagine the dust mote from Tom swelling until 1t 1s as impactful 
as Pam's brick), and obviously we cannot determine at which point 
noncause turns into cause Likewise, there 1s a continuum of cases 
between the same process slightly altered and a different process 
entirely So our understanding of the causally relevant difference 
between preempting causes and trace noncauses 1s vague along ex- 
actly the same continuum as our understanding of the counterpart 
relation for processes analysans swings with analysandum Thus Anal- 
ysis 1 not only explains the causally relevant difference between 
preemptions and traces (essential versus 1nessential process parts), 
but does so in the right way to mirror the logical structure involved 

In disconnections, the disconnector comes out a cause because it 
shields the E-process from what otherwise might have interrupted it 
without the disconnector, the chance of the Eprocess running to 
completion would have been less In the sabotage case, the crashing 
process consists m the main of the plane flying mountamward 
through the relevant interval and then crashing Without the sabo- 
tage, the chance of this process would have been less, because the 
alert would then have been transmitted and the pilot would then have 
turned the plane away Thus the sabotage shields the crashing pro- 
cess from the alert 

Fizzlers, 1n contrast, come out noncauses because they are neither 
essential parts of, nor shields for, the E-process Thus Fred's misfiring 
1s neither a part of nor a shield for the process by which Pam throws 
her brick through the window This solution extends to overlappings, 
given a suitably general conception of process to appreciate that the 
cause (U238) 1s connected to the effect and the noncause (Ra226) 
not ?7 


IV REFINEMENTS 
Analysis 1, which identifies causes with probability raisers of pro- 
cesses, while an improvement over its predecessors with respect to the 
space of problems discussed above, 1s subject to (at least) three 
objections First, the account fails to provide for the transitivity of 
causation There are cases (example to be given shortly) ın which C 
is a PROP for D and D is a PROP for E, but C 1s not a PROP for E 
Second, the account fails to provide for the continuity of causation It 


?7 Applying the lawful sequence analysis since the decay laws are U238 — (Th234 
& alpha) and Ra226 — (Rn222 & alpha), the absence of Rn222 shows that the 
consequent of the radium decay law 1s not instantiated Thus the process in 
question ıs not radium decay Rather, as the presence of Th234 shows, the process 
in question 1s uranium decay 
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seems that, at least in the actual world (leaving the mysterious EPR 
correlations of quantum mechanics out of the discussion), all causa- 
tion involves spatiotemporally continuous chains of causal 1nterme- 
diaries. But there are actual-world-ish cases (example to be given 
shortly) ın which Cis a PROP for E but there are gaps in the line of 
PROPs from C to E Third, the account ıs subject to counterexample 
from preemptive disconnections 

All three objections can be brought out in the following case, which 
feeds a preemption case into a disconnection (preemptive disconnec- 
tion) Pam and Bob are out to sabotage the radio tower and prevent 
an alert from reaching the pilot Pam places her puny bomb (1 
chance of blowing up the tower), while Bob 1s too overcome with 
laughter to proceed (he would have almost certainly blown up the 
tower had Pam not distracted him) Pam’s bomb then actually suc- 
ceeds in destroying the tower (preemption), and so the alert 1s 
prevented, and so the plane crashes (disconnection) 


Pam o> Sabotage Crash 
o o0 O 
Bob ~~e 


Alert OF 7 


Flight O 
DIAGRAM 4 


Pam's saboteuring is clearly a cause of the crash and Bob's is not, 
but Analysis 1 unfortunately gets this backward Pam's saboteuring 1s 
a probabihty lowerer of the crash process (which consists, in the 
main, of the plane flying mountainward through the relevant inter- 
val) since Bob is a more reliable saboteur, the plane's chances would 
have been worse with Bob in action And Bob's saboteuring 1s a 
probability raiser of the crash process, given that there was some 
chance he might have managed not to get distracted by Pam In 
short, Pam 1s a preempting shield and Bob a fizzled shield for the 
crash 

It ıs worth noting that preemptive disconnections are everyone's 
problem (tu quoque) Probabihty-raising accounts get them back- 
ward generally, since Pam 1s a preemptor and Bob a fizzler Pro- 
cess-linkage accounts get 1t right that Bob 1s a noncause, but only 
because they cannot count Pam as a cause at all since she 1s a 
disconnector 
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Preemptive disconnections also illustrate the intransitivity and dıs- 
continuity of Analysis 1 Intransitivity Pam’s saboteuring 1s a PROP of 
the destruction of the tower, which 1s a PROP of the crash, but as per 
above her saboteuring 1s not a PROP of the crash Discontinuity 
Bob’s saboteuring 1s a PROP of the crash, but there 1s a discontinuity 
where he fizzles 

I see two options First, one might bite the bullet on all three 
objections Transitivity has been questioned before,?? continuity 1s 
equally questionable, and perhaps one could learn to live with a 
counterintuitive verdict in preemptive disconnections, especially as 
long as no alternative analysis of causation fares better 

A second option, which I prefer, 1s to refine the analysis It turns 
out that all three objections can be handled, and the original solu- 
tions to preemptions, traces, disconnections, and fizzlings main- 
tained, by adding a continuity requirement 

So I propose 


Analysis 2 C causes E if and only if Cis a continuous PROP for E 


The continuity 1n question must be with respect to something, and 
the obvious candidate ıs space-time Thus (at least in worlds for which 
the causal order 1s the temporal order), I further interpret Analysis 2 
as 


Analysts 2 Interpreted Cis a continuous PROP for E if and only if there 1s 
a cham of direct PROPs between C and E, where C 1s a direct PROP for 
Ef and only if (a) Cis a PROP for E, and (b) for all ames £5 between tc 
and tg, there ıs a D at tp such that C is a PROP for D, and D ıs a PROP 
for E 


Analysis 2 answers the transitivity and continuity objections, while 
handling preemptive disconnections Transitivity 15 gained, because 1f 
Ccauses D and D causes E then it follows that there 1s a chain of direct 
PROPs between Cand D, and between D and E, and therefore a chain 
of direct PROPs between Cand E Continuity 15 gained explicitly And 
in the preemptive disconnection case above, Pam's saboteuring will 


28 Hartry Field proposes the following counterexample to transitivity your enemy 
places the bomb by your door (C), which causes your friend to defuse the bomb 
(D), which causes you to survive to eat breakfast next morning (E) But surely your 
enemy placing the bomb by your door (C) does not cause you to survive to eat 
breakfast (E)! As the reader may confirm ın this example, Cis a PROP for D (as an 
essential part of the D-process), Dis a PROP for E (as a shield of the E process), but 
Cis nota PROP for E For further proposed counterexamples to transitivity, see also 
Michael McDermott, “Redundant Causation,” British Journal for the Philosophy of 
Science, XL (1995) 523-44, Hall (op ct) and “Causation and the Price of Transitiv- 
ity,” this JOURNAL, XCVII, 4 (April 2000) 198-222 
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count as a direct PROP of the blowing up of the tower, and the 
blowing up of the tower will count as a direct PROP of the crash, and 
so Pam will come out a cause Bob, on the other hand, cannot be 
connected to the crash by direct PROPs because there 1s a disconti- 
nuity involved at the point where he 1s overcome with laughter and 
fizzles out there is no event D to be found at that time such that both 
Bob's saboteuring 1s a PROP for D and D 1s a PROP for the crash 
(Bob's laughter fails the latter) ?? 

As the reader may confirm, Analysis 2 preserves the solutions to 
preemptions, fizzlings, disconnections, and traces The preemptor 
will count as itself a direct PROP (no chaining needed) of the effect, 
with the continuous path of the preempting process providing the 
relevant continuously present D events The fizzler will not count as 
a direct PROP of the effect and no chaining of direct PROPs will 
rebuild a connection, since the fizzle point will always constitute a 
discontinuity The disconnector will count as a direct PROP, with the 
continuous D events being the sequence of events of the destruction 
of the tower, followed by the sequence of absences of the alert 
transmission to the pilot, followed by the sequence of the pilot 
maintaiming course And the trace will not count as a direct PROP 
and no chaining of direct PROPs will rebuild a connection, since 
though there ıs a continuous PROP sequence of the trace arriving at 
the effect, the irrelevance of any part of that sequence to the end 
always remains 

But there ıs a price to be paid 1t looks as if a continuity require- 
ment must stipulate away the seeming possibility of spatiotemporally 
discontinuous causations ?? If Merlin casts a "prince to frog" spell at 
noon and the prince turns into a frog at midnight, and there 1s a law 
directly connecting magic spells at noon to enchantments at mid- 
night, then 1t seems that Merlin's spell 1s a discontinuous cause of the 
enfrogging Analysis 1 has no problem with such cases the process 
involved will be the two-membered sequence (spell, enchantment), 
and the spell will be an essential part of such a sequence But Analysis 
2 will find no event Dat, for example, 6 00 pm such that Merlin's spell 
is a PROP for D, and D 1s a PROP for the enfrogging 


29 If we merely take the transitive closure of Analysis 1, this will fail to exclude 
Bob 

30 If there ıs some independent way of defining the “causal dimension,” then the 
PROPs relation can be required to be continuous with respect to that dimension 
rather than space-time This would solve the problem, but I do not know how to 
define such a dimension 
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I see three options retreat to Analysis 1, bite the bullet and declare 
discontinuous causation impossible,?! or refine the analysis further to 
maintain the original solutions, handle preemptive disconnections, 
and allow discontinuous causation 

The causally relevant difference between Bob the fizzled shield and 
Merlin the successful spellcaster seems to be that, although both are 
PROPs for their respective Es, there 1s a causal gap 1n the line of 
PROPs from Bob’s saboteuring to the collision, and no causal gap in 
the PROP relation from Merlin’s spellcasting to the enchantment If 
one can determine which PROP relations need intermediaries and 
which do not (obviously without relying on causal or temporal gaps) 
then one can require continuity with respect to needed intermediar- 
1es only, rather than with respect to space-time, and thereby achieve 
a successful further refinement 

But I do not know how to determine when intermediaries are 
needed Some not implausible suggestions are available perhaps one 
can study whether C remains a PROP for E on the counterfactual 
supposition of certain candidate needed-intermediaries not occur- 
ring, or perhaps one can study whether there are intermediaries D 
(where C 1s a PROP for D and D 1s a PROP for £) in certain 
unproblematic exemplars of such a C causing such an E I do not 
know whether these suggestions will work generally More work re- 
mains to be done 

The nature of the causal relation remains mysterious, but I believe 
that in important respects both Analysis 1 and Analysis 2 shed more 
light on this relation than any extant alternatives, and that there may 
be some Analysis 3 nearby that will yield complete success I conclude 
that causes are best understood as probability raisers of processes, 
though exactly how best the story goes remains an open question 

JONATHAN SCHAFFER 
University of Massachusetts/Amherst 


3! Mellor (op ct) regards spatiotemporal continuity as a central connotation of 
causation, and the other successes of Analysis 2 may be taken as an indirect 
argument for Mellor's claim 


BOOK REVIEWS 93 


BOOK REVIEWS 


Responsibility and Control A Theory of Moral Responsibility JOHN MARTIN 
FISCHER and MARK RAVIZZA New York Cambridge University Press, 
1998 vu + 277 p Cloth $64 95, paper $19 95 * 


In Responsiihty and Control A Theory of Moral Responsibility, John 
Martin Fischer and Mark Ravizza defend a distinctive compatibilist 
account of the control (or free will) condition necessary for moral 
responsibility They maintain that this condition requires guidance 
and not regulative control Guidance control consists in the power to 
perform an action freely, regulative control consists ın the power to 
perform an action freely, and also the power to act otherwise (31) 
Their analysis of guidance control yields a reasons-responsive, actual- 
sequence, mechanism-based model Reason responsiveness charac- 
terizes the pertinent control in terms of the capacity to respond 
appropriately to (at least some) reasons for action (41-46) An actual- 
sequence account illuminates control purely ın terms of the (dispo- 
sitional) properties present in the actual causal sequence of events 
giving rise to freely willed action, and not even partially in terms of 
the properties manifest ın alternative courses of action (37-38) A 
mechanism-based account does not attribute control to an agent, but 
to more carefully specified features directly involved ın the etiology of 
an agent’s action—features that, Fischer and Ravizza explain, must be 
the agent’s own 1n an important sense (38-39) 

Mechanism-based accounts as opposed to agent-based accounts 
give reasons-responsive, actual-sequence compatibilist models a cru- 
cial advantage ın explaining the guidance control at work ın an 
example of the sort made famous by Harry Frankfurt These contro- 
versial examples attempt to show, typically in defense of compati- 
bilism, cases in which a paradigmatically morally responsible agent 
acts freely but ın which circumstances playing no role in an agent’s freely 
willed action nevertheless ensure that the agent cannot do other than 
she does Notice that, ın such cases, an agent 1s not reasons responsive 
(38) Due to the ensuring conditions, were the agent presented with 
reason to act otherwise, the agent would not be able to do so But 


* I would like to thank Bernard Berofsky, Carl Ginet, Ishtyaque Hay, Al Mele, 
Derk Pereboom, Eleonore Stump, and David Widerker for excellent comments on 
earlier drafts of this review I owe a special debt of gratitude to John Martin Fischer 
for his thorough comments and generous support 
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focus instead upon the agent’s mechanism of action in the actual 
sequence of events If the mechanism were operative, and if different 
reasons were presented to the agent, then the agent, by way of this 
mechanism, would do otherwise (38-39) (Of course, in worlds ın 
which such a mechanism 1s operative and the agent has reason to do 
otherwise, the restrictive ensuring conditions are not in place) 
Hence, a mechanism-based approach permits Fischer and Ravizza a 
reasons-responsive, actual-sequence analysis of control even in Frank- 
furt cases This is an enormous theoretical advantage 


I 

There ıs a great deal at stake 1n Fischer and Ravizza's work Theirs 1s 
the most advanced, thorough, and provocative treatment of compati- 
bilism to date Several features of their account make this so Here 
are four 

First, by offering a convincing account of guidance control—an 
account that does not require the ability to do otherwise—Fischer 
and Ravizza are able to avoid the most powerful case for incompati- 
bilism that determinism rules out the freedom to do otherwise, and 
the inability to do otherwise rules out moral responsibility (33-34) 

Second, Fischer and Ravizza’s account elegantly advances an 1n- 
sight about reasons responsiveness central to Susan Wolfs highly 
influential work in, among other places, Freedom within Reason! 
Wolf’s view, while enticing, leaves the notion of reasons responsive- 
ness unanalyzed It ıs also open to mcompatibilist worries about 
blameworthiness Why? For Wolf, praiseworthiness and blameworthi- 
ness are asymmetrical in their control conditions (zd, pp 79-81) 
While praiseworthiness requires only guidance control, blameworthi- 
ness requires regulative control But this reliance on regulative con- 
trol invites classic 1ncompatibilist worries about the ability to do 
otherwise Fischer and Ravizza’s account remedy both of these short- 
comings by carefully articulating the conditions for reasons respon- 
siveness (62-91), and also by showing that the intuitions driving Wolf 
to treat moral responsibility for the praiseworthy as resting upon 
guidance control (and not alternative possibilities or regulative con- 
trol) can just as easily be brought to bear upon the blameworthy 
(55-61) 

Third, Fischer and Ravizza’s compatibilism can be understood as a 
defense of the compatibilist thesis advanced by Daniel Dennett in 
Elbow Room Vamneties of Free Will Worth Wanting? Dennett, too, mam- 


1 New York Oxford, 1990 
? Cambridge MIT, 1985 
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tained that the freedom associated with alternative (and not actual) 
causal sequences 1s otiose (zb:d, p. 132) Fischer and Ravizza, how- 
ever, take far more seriously than Dennett was willing to, incompati- 
bist arguments for the mcompatibility of determınısm and the 
freedom to do otherwise (17-25, and 33-34) Acknowledging that 
such freedom would be sufficient to satisfy the control necessary for 
moral responsibility, Fischer and Ravizza concede that this kind of 
freedom zs incompatible with determinism Also, ın Elbow Room Den- 
nett swiftly dismissed Frankfurt’s famous compatibilist argument 
based on counterexamples Dennett stated that such examples, while 
interesting, were "insufficiently ambitions” (zb:d, p 132) As such, 
they could not bear the fruit of a general account of compatibilist 
freedom Fischer and Ravizza, alternatively, make use of the Frank- 
furt examples as a key ingredient ın their demonstration that it 1s only 
guidance control that 1s necessary for moral responsibility On this 
score, while Dennett simply missed the boat, Fischer and Ravizza built 
a better one! 

Fourth, Fischer and Ravizza’s defense of compatibilism also makes 
innovative appeal to historical considerations as a condition for free- 
dom and responsibility (170-206) Compatibilism has long been 
threatened by manipulation cases that show that all the requisite 
“compatibilist-friendly” control conditions could be satisfied by some, 
intuitively, responsibility-undermining process of manipulation If so, 
then what makes determinism different from, say, a demon’s process 
of manipulation? Fischer and Ravizza give an historical account of 
how an agent’s reasons-responsive mechanism of action comes to be 
her own (207-39) They argue that such a process cannot, in princi 
ple, be the upshot of a process of manipulation, yet ıt could be 
realized 1n a determined world This aspect of their account also 
rightly earns their work a prominent status within the literature 

For all of the above reasons, Fischer and Ravizza's Responsibility and 
Control represents an outstanding advance in the free-will debate 
Despite these attributes, 1n what follows, by focusing upon their use of 
mechanism individuation, I shall argue that their defense of an 
actual-sequence model is threatened on plausible 1ncompatibilist 
grounds 


II 
In advancing their account of reasons responsiveness, Fischer and 
Ravizza devote considerable attention to a proper articulation of the 
degree of responsiveness required to capture the contours of morally 
responsible agency They attempt to move between two extremes, 
one in which an agent acts from a reasons-responsive mechanism that 
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1s sensitive to any sufficient reason to do otherwise, another ın which 
an agent acts from a reasons-responsive mechanism that is sensitive to 
perhaps just one reason to act otherwise (or 1nstead some very small 
and perverse range of reasons to act otherwise) (63-68) One extreme 
sets standards for morally responsible agency too high, another too 
low To advance their view, Fischer and. Ravizza make use of the 
notion of a “mechanism” issuing in action They thus unpack reasons 
responsiveness by first holding fixed the mechanism issuing 1n action in 
the actual world Then they consider possible worlds ın which the 
“same” mechanism 1s active and in which different reasons to act 
otherwise are presented to the agent In some of those worlds, 
Fischer and Ravizza maintain, the agent does act otherwise This 
range of worlds, and of reasons, according to Fischer and Ravizza, 
must demonstrate a rational pattern (69-85) Their work here 1s 
subtle, and the account of “moderate reasons responsiveness” which 
emerges 1s compelling It hangs, however, on their ability to make use 
of the notion of “same mechanism ” 

Fischer and Ravizza openly admit that they have no principled 
basis for individuating mechanisms They simply settle for natural or 
intuitive judgments 1n various cases (47) As with sameness of smile 
or sameness of house, Fischer and Ravizza explain, sameness of 
mechanism does not require sameness right down to microdetails 
(52) Just as we can recognize a good friend’s “same old smile” over 
the years, despite an aging mug, so too can we recognize “same 
mechanism issuing in action” despite variations in things like the 
reasons present to the agent in worlds 1n which the same mechanism 
operates ? 

Fischer and Ravizza state that by *Àmechanism' they do not mean to 
point to “anything over and above the process that leads to the 
relevant upshot" (38) But, construed most naturally, the process that 
leads to an agent's action includes the entire complex of proximal 
antecedent events and states figuring into a correct explanation of 
why the action occurred * How should the salient features from this 
complex be isolated as those figuring in the particular mechanism 
issuing in action? If, despite Fischer and Ravizza's denial, sameness 


3 The only restrictions Fischer and Ravizza place on the mechanisms at issue 1s 
that they must be temporally intrinsic Thus, the temporally extrinsic mechanism 
deliberation preceding the donation of five dollars to a charity entails that an agent give five 
dollars to a chanty in any world in which that mechanism operates As such, this 
mechanism 1s not the kind at issue with regard to guidance control (46-47) The 
temporally intrinsic mechanism, deliberation involving a donation to a charity, does not 
entail that an agent donate to a charity at a later tme 

4I thank Carl Ginet for this insight 
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down to the microdetails were required for sameness of mechanism, 
their account of reasons responsiveness would face serious troubles 
Suppose that the blameworthy reason (desire, intention, or belief) 
upon which an agent acts ın the actual world ıs identical with (or 
supervenes upon) some neurophysiological state of the brain If 
‘same mechanism’ requires sameness of microdetails, then, when 
different sufficient reasons to do otherwise are present in another 
possible world, the operative mechanism ın the possible world would 
have to respond to reasons which shared the same neurophysiological 
details as those associated with the reasons present to the agent in the 
actual world But this would be nomically implausible (albeit not 
metaphysically impossible) If, as seems likely, the different reasons 
presented to the agent m the other possible world did not share the 
same microdetails, then in that possible world, holding fixed the mech- 
anism operative in the actual world, the agent might not respond appro- 
priately to the sufficient reason to do otherwise Minimally, these 
reflections indicate that Fischer and Ravizza's thesis would be suscep- 
tible to refutation based upon likely empirical data 

It 1s therefore crucial to Fischer and Ravizza’s approach that they 
exclude a narrow construal of sameness of mechanism Yet because 
they offer no prncpled basis for mechanism individuation, they must 
rest their thesis purely on zntuztzve reactions to different cases But, it 
might be objected, which elements from the entire complex (of 
proximal events and states antecedent to an action) should figure 
intuitively into the relevant mechanism will vary relative to explana- 
tory perspective The neurophysiologist’s basis of parsimony will be 
different than that employed in everyday folk-psychological dis- 
course What reason have we to assume that Fischer and Ravizza's 
basis for individuation ıs the correct one? 

The situation worsens 1f one pushes for a hyper-restricted notion of 
sameness of mechanism On the hyper-restricted construal, the en- 
tre complex of proximal antecedent events and states function as the 
pertinent mechanism If this were the relevant mechanism, an agent 
could not act from a reasons-responsive mechanism at a deterministic 
world Any possible world 1n which the "same" mechanism operates 
at some time must share all the same facts with the actual world at that 
tme But if determinism ıs true in the actual world, there ıs no 
possible world which shares the same facts with the actual world at a 
time and in which, at a subsequent time, an agent acts differently in 
the possible world than she does 1n the actual world On the hyper- 
restricted construal, only the incompatibilist could make sense of a 
reasons-responsive mechanism Only at an indeterministic world 
could it be the case that the very mechanism operating ın the actual 
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world operates ın a possible world and, on the basis of different 
reasons, the agent acts differently at that possible world 5 

Some incompatibilists might be inclined to draw their line in the 
sand here, insisting upon a hyper-restricted notion of mechanism 
individuation But, anticipating this move, Fischer and Ravizza reply 
that this would be question begging (52, note 22) The source of the 
incompatubilist’s insistence would arise from the very thought that the 
appropriate responsiveness requires that the agent be able to act 
differently in the same world as the actual world in which she acts 
This ıs precisely what ıs at stake On this point, Fischer and Ravizza 
seem right Yet it also seems problematic for Fischer and Ravizza to 
lean exclusively on intuitions about different cases in order to work 
with a looser notion of sameness of mechanism Without any prin- 
cipled way to individuate mechanisms they are vulnerable to a similar 
charge—that their intuitions regarding sameness simply mask their 
compatibilist convictions that responsiveness does not require that an 
agent be able to act differently in the actual world 

There are other places where Fischer and Ravizza lean too heavily 
upon their intuitive judgments regarding sameness of mechanism 
For instance, they distinguish two components of reasons responsive- 
ness Reasons receptwity allows agents to recognize and evaluate rea- 
sons for action (69-73) Here, Fischer and Ravizza place considerable 
constraints on the range of reasons to which a morally responsible 
agents mechanism of action must be receptive Reasons reactrorty 
allows agents to act on the basis of the reasons for action (recognized 
at the receptivity stage) (73-76) Fischer and Ravizza permit a morally 
responsible agent’s mechanism of action a quite lumited and “idio- 
syncratc" pattern of reactivity (74-75) They hold that a morally 
responsible agent that 1s blameworthy for her action must act from a 
mechanism that would at least react to some reason to act otherwise, 
were some other reason to do so present in another possible world 

Given Fischer and Ravizza's account of reasons reactivity, here 1s a 
reasonable (even predictable) incompatibilist objection reacting to a 
different sufficient reason to do otherwise in some other possible 
world will not show why an agent should be held blameworthy for not 
reacting to a morally sufficient reason to do otherwise in the actual 
world Is not the relevant question, ‘Could the agent have reacted 
differently to the very reason that was presented to her in the actual 
world?’ Fischer and Ravizza respond by claiming that “reactivity 1s all 


5 At an indeterministic world, ıt need not be determined which of competing 
reasons the agent decides to act on, nor, for that matter, what reasons bear upon an 
agent’s deliberations 
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of a piece " Reacting to a different sufficient reason to do otherwise 
in another possible world proves a general capacity to react to any 
sufficient reasons to do otherwise Hence they conclude that the 
agents mechanism retains this general capacity 1n the actual world 
There is a great deal at stake in this move Without it, to show that a 
blameworthy agent acts from a reasons-reactive mechanism, one 
would have to show that there are possible worlds in which the agent 
reacts differently to the same moral reasons as those present in the 
actual world But, as explained above, this would involve incompati- 
bilist-friendly regulative control 

When defending their claim that reactivity 1s all of a piece the 
authors ask how ıt might be that reactivity 1s not all of a piece That is, 
how might it be that an agent 1s able to react to a sufficient reason to 
do otherwise in one possible world and yet does not retain the 
general capacity to react to different reasons 1n other possible worlds 
(in which the same mechanism operates)? They offer the following 
speculation 


How might this have turned out not to be true? Imagine that the agent 
somehow gets considerably more energy or focus if he 15 presented with 
a strong reason to do otherwise, and it 1s only in virtue of these factors 
that he successfully reacts to the reason (74) 


Here 1s how Fischer and Ravizza respond 


There certainly can be cases like this, but it 15 natural to say that, when 
the agent acquires significantly more “energy or focus,” this gives rise to 
a different mechanism from the actual mechanism Our point 1s that, 
holding fixed the actual kind of mechanism, reactivity 1s all of a piece if the 
mechanism can react to any reason to do otherwise, it can react to all 
such reasons (74) 


Fischer and Ravizza cast the thought experiment in terms of the 
agent’s getting more energy or focus when presented with strong 
reasons Of course, the question is, can the agent's mechanism gain 
more energy or focus when presented with stronger reasons? If it 
could, then 1t would not be the case that reactivity 1s “all of a piece ” 
If one grants the intelligibility of talk of ‘mechanisms’ as opposed to 
agents in the first place, then 1t 1s hard to understand why it 1s any less 
natural to explain the thought experiment by saying that the very 
mechanism issuing in one action might get a different amount of 
charge or focus in the presence of some reasons than in the presence 
of others Because so much rides on their claim that reactivity 1s all of 
a piece, and because (granting talk of mechanisms) there are com- 
petitors as to what seems a natural way to explain the thought 
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experiment, Fischer and Ravizza’s appeal to what seems most natural 
looks out of place 


II 

Fischer and Ravizza offer a compelling account of reasons-responsive, 
actual-sequence compatibilism On their view, the best case for com- 
patibilism proceeds by focusing upon the properties manifest 1n the 
actual sequence of events leading to action More traditional com- 
patibilist approaches foundered by worrying about alternative courses 
of action requiring access to alternative causal sequences This m- 
vited forceful incompatibilist arguments (such as the well-known 
consequence argument) about the incompatibility of determinism 
and alternative possibilities By offering a dispositional analysis of the 
properties present 1n the actual casual source of an agent’s action, 
Fischer and Ravizza disarm this incompatibilist strategy All they need 
to do 1s offer an analysis that captures an intuitively plausible basis for 
the control manifested by the agent in the actual world Locating the 
pertinent control in an agent’s reasons responsiveness (via an actual- 
sequence mechanism) certainly does that This 1s a graceful philo- 
sophical maneuver, painstakingly worked out in clear and provocative 
detail Their efforts have rightly placed their compatibilist view at the 
center of contemporary debate 

Fischer and Ravizza's appeal to sameness of mechanism 1s the 
lynchpin ın their defense of an actual-sequence, reasons-responsive 
analysis of guidance control Regrettably, their exclusive reliance on 
intuition. as a basis for mechanism individuation renders their de- 
fense of their overall theory unconvincing There are too many 
pressure points at which differing intuitions regarding sameness of 
mechanism yield troubling results for their defense of guidance 
control Thus, to defend their compatibilist account of moral respon- 
sibility fully they must address this source of trouble 

MICHAEL MCKENNA 

Bryn Mawr College 
Ithaca College 
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DISJUNCTIVE PROPERTIES MULTIPLE REALIZATIONS* 


here has been much debate concerning the status of “dis- 

Junctive properties,” and much of the debate 1s fueled by 

attacks and defenses of nonreducteve physicalism (NRP) The 
most prevalent argument in support of NRP, the multiple realizability 
argument, 1s standardly thought to stand or fall depending upon the 
legitimacy of such properties John Heil! presents this standard line 
of thought as follows 


We unhesitatingly ascribe mental states across species, despite large 
differences in underlying physiology Even within our own species, it 
seems unlikely that particular mental characteristics are invariably real- 
ized 1n identical neural structures 

Multiple realizability, however, need not deter a determined identity 
theorist [that 1s, reductionist] It1s open for such a theonist, for instance, 
to argue that the relevant characteristic 1s, 1n fact, disjunctive in char- 
acter That is, ıt might be that, ın you, mental feature M is realized in 
neural structure N, whereas ın an octopus, M ıs realized ın a different 
sort of neural structure N’ Would this undermine type idenüuty? It 
would not, unless we assume that M [could] not be identical with the 
disjunctive characterisuc <N or N’> (ibid, p 64) 


If there were legitimate disjunctive properties, the argument from 
multiple realizability would not refute reductionism Hence advo- 


* Most of the ideas for this paper developed from John Heil's National Endow- 
ment for Humanities Summer Seminar, “Metaphysics of Mind,” held at Cornell 
University ın 1996 The paper owes much to Heil and all the parucrpants, though 
Michael Watkins, Heather Gert, David Pitt, and Eric Saidel deserve special mention 
They are not, however, responsible for the faults in this paper The paper has also 
benefited from useful exchanges with Sydney Shoemaker, Alex Byrne, Sylvain 
Bromberger, Jeff McConnell, and the members of the philosophy departments at 
Carleton University and the University of Wisconsin/Madison 

! The Nature of True Minds (New York Cambridge, 1992) 
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cates of NRP argue that there are no disjunctive properties, or that 
they are in some way illegitimate ? Contrary to this standard line of 
thought, I shall argue that there 1s good reason to grant the existence 
and legitimacy of disyunctive properties, and I shall sketch an account 
of such properties Yet J shall also argue that the existence and 
legitimacy of disjunctive properties 1s compatible with a robust, anti- 
eliminativist, version of NRP, a version of NRP that ought to satisfy 
psychologists and philosophers of mind (and economists, biologists, 
and the like) 


I NRP, THE ARGUMENT FROM MULTIPLE REALIZABILITY, AND THE 
DISJUNCTIVE STRATEGY 


NRP is defined as the conjunction of two theses 


Physicahsm all particulars are constituted by physical particulars, and all 
properties are realized by physical properties ? 

Nonreducibihty mentalistic predicates cannot be reduced by physicalistic 
predicates 


Before the argument from multiple realizability in support of NRP 
can be presented, some clarifying remarks concerning these theses 
are in order To clarify the thesis of physicalism, one must specify 
what it 1s for particulars (be they objects or events) and properties to 
be physical, and one must define the relations of constitution and 
realization. The issue of what 1t 1s for particulars and properties to be 
physical 1s central and all too often overlooked, but I shall not here 
address ıt I shall also have nothing to say concerning the constitution 
relation, as my focus will be on properties and not particulars I shall, 
however, make a significant proposal concerning the realization re- 
lation, but for now a rough working definition of the reahzation 
relation will suffice a property P of an object (or event) o realizes a 
property F of o1f and only if (1) 1t 15 necessary that, 1f o instantiates P, 
then oinstantaates F, and (11) o5 instantiating Pin some metaphysical 


2Influential defenders of NRP and criucs of disjunctive properties include 
Hillary Putnam, “Psychological Predicates,” in WH Capitan and D D Merrill, eds, 
Art, Mind, and Rehgion (Pittsburgh University Press, 1967), Ned Block and Jerry 
Fodor, "What Psychological States Are Not,” Philosophical Review, LXXXI, (1972) 
159-81, and Fodor, “Special Sciences," ın Representations (Cambridge. MIT, 1982), 
pp 127-45 

3 Two points First, a weaker version of physicalism requires only that all property 
instantiations be instantiated in physical particulars I think this weaker version of 
physicalism fails to capture the essence of NRP, but none of my arguments depends 
upon the stronger thesis And second, the theoretical role played by the notion of 
realization ın this characterization of physicalism 1s often played by the notion(s) of 
superventence It will become clear in section 1v why I prefer to define physicalism in 
terms of realization 
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sense explains o5 instantiating F—being Pis one way tn which a thing 
can be F So, for example, the property of being 85? C which is 
instantiated by my coffee cup 1s, 1t 15 plausible to suppose, realized by 
some very discriminating microphysical property which 1s also instan- 
tiated by my cup Given that my cup has that discriminating micro- 
physical property, 1t ıs necessary that ıt also have the property of being 
85° C And that my cup has that very discriminating microphysical 
property explains that my cup has the property of being 85° C—that 
particular microphysical property that my cup instantiates 1s one of 
many ways in which my cup could be 85° C Thus to say that all 
mental properties are realized by physical properties 1s to say that for 
every instantiation of a mental property M by an object (or event) o, 
there 1s some physical property P instantiated by o such that it 1s 
necessary that, if o instantiates P, then o instantiates M, and o’s being 
P explains—in the robust metaphysical sense—o’s being M* 

As I have formulated the thesis of nonreducibility, ıt concerns the 
nonreducibility of mentalistic predicates, and not the nonreducibihty 
of mental properties In fact, strictly speaking the reduction relation 
holds between theores, but, at least within the framework of Ernest 
Nagel's? classic account of reduction, a derivative notion of reduction 
that holds between predicates can be specified. On Nagel's account, 
very roughly, a theory T, 1s reduced to a theory T; just ın case the laws 
that make up T, are shown to be derivable from the laws that make 
up Ts, together with a set of "bridge principles" that “connect” the 
predicates of T, with predicates of T; We may take the existence and 
truth of such a bridge principle that connects predicates @ and y to 
be necessary and sufficient for the predicate reduction of $ by y? 
Thus, the derivative notion of predicate reduction 1s defined as follows 
predicate 1s reduced by predicate y if and only if there ıs a 
universally quantified bridge principle that connects $ and y But 


* This sketch of the realization relation 1s essentially the account of realization 
proffered by Ernest LePore and Barry Loewer in "More on Making Mind Matter,” 
Philosophical Topics, xvi, 1 (1989) 175-91 

5 See The Structure of Science (New York Harcourt, Brace and World, 1961) 
Nagel's account of theory reduction is someumes criticized as requiring too much 
for theory reduction Thus, even if the defender of NRP ıs able to show that 
mentalistic predicates are not Nagel-reducible to physicalisuc predicates, she has 
not thereby adequately defended NRP, for a weaker, more accurate, account of 
theory reduction may allow for the reducibility of the mental to the physical See, 
for example, John Bickle, "Muluple Realizability and Psychophysical Reduction," 
Behavior and Philosophy, xx, 1 (1992) 47-58 See also Patricia Churchland, Neurophi- 
losophy (Cambridge MIT, 1986) 

$ On this account, the relation of predicate reduction 1s symmetric. This 1s not 
problematic, as the relation of theory reduction 1s (still) asymmetric 


114 THE JOURNAL OF PHILOSOPHY 


now, of course, we must clarify what ıt 1s for there to be a universally 
quantified bridge principle that "connects" $ and y What relation 
must obtain between predicates $ and wif $ ıs to be connected to, 
and thereby reduced by, y? 

The classic reductionists, or “type-type identity theorists,” such as 
UT Place and J J C. Smart,” make it abundantly clear that a primary 
motivation for their theory 1s the ontological parsimony that results 
from identifying mental states with physical states In the sense in 
which reduction 1s relevant to the mind-body problem and the tena- 
bility of NRP, the reduction of one theory T, to another theory Ts 
collapses the ontological commitments of T, to those of Ts, if T, 1s 
reduced to Ty, then one who endorses both theories 1s committed to 
the existence of only the entities posited by Tọ Consequently, the 
bridge principles by which a reduction ıs obtained must be inter- 
preted as entailing, or at least warranting the assertion of, property 
identities (At least one must assent to this if one ıs to be a realist 
about properties and maintain that some predicates pick out, or 
designate, properties) So in asking, “What relation must obtain 
between predicates ¢ and if 1s to be reduced by y?" we are asking, 
“What relation must obtain between two predicates if they are to be 
taken to designate the same property?” 

Fortunately, my purposes here do not require me to provide an 
answer to the contentious question of what conditions are necessary 
and jointly sufficient for predicate reduction In order to clarify the 
argument from multiple realizability ın support of NRP, and the 
“disjunctive strategy” ın response to this argument, I need only the 
noncontentious assumption that a necessary condition for the predi- 
cate reduction of ¢ by wis that a universally quantified biconditional 
of the form Vx (px = wx)! be nomologically necessary Whatever might 
be sufficient for predicates $ and y to designate the same property, it 
1s at least necessary that such a universally quantified biconditional 
hold with nomological necessity 

The argument from multiple realizability 1s really an argument only 
1n support of the thesis of nonreducibihty, indeed, the argument is 
sometimes formulated 1n such a way that 1t presupposes the thesis of 
physicalism The argument was first sketched by Putnam (op cit) and 
then clarified by Fodor (op cit) and it has been widely discussed 
since Note that the argument from multiple realizability 1s alleged to 
demonstrate that the nonreducibility of mentalistic predicates by 


7 See Place, “Is Consciousness a Brain Process?" British Journal of Psychology, XLVII 
(1956) 44-50, and Smart, “Sensations and Brain Processes,” The Philosophical Review, 
Lxux (1959) 141-56 
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physicalistic predicates follows from the fact that mental properties 
are multiply realized by physical properties It 1s not obvious, how- 
ever, how nonreducibility, which 1s at least ın part an epistemological 
claim concerning predicates, could follow from the metaphysical claim 
that mental properties are multiply realized by physical properties A 
mental property M is multiply realized by physical properties 1f and 
only if ıt has some realization. base of physical properties P, Po, P, 
(n > 1) where for each P, ın the realization base, P, realizes M* 
Suppose that mental property M ıs multiply realized by properties Pj, 
P, P, and that a predicate p designates M, and finally that for each 
P, there 1s a corresponding predicate 7, designating P, Further 
suppose that P, Ps, P, are all metaphyswally incompatible with each 
other, that is, for all objects o, o cannot instantiate both Pj and P5, nor 
both P, and P}, and so on It follows that M 1s not identical to any of 
the P,? Moreover, assuming that pw 15 predicate reduced by 7, only if 
a biconditional bridge principle of the form Vx (px <> mx) is (at 
least) nomologically necessary, it follows that j& ıs not predicate 
reduced by any v, each P,in M's realization base 15 sufficient, but not 
necessary, for M Consequently, each 7, 1s such that satisfying 77, 15 
sufficient, but not necessary, for satisfying p, and thus no bridge 
principle with the form displayed above 1s nomologically necessary 
So, u cannot be predicate-reduced by any of the m, But how does ıt 
follow, as 1s required to support the thesis of nonreducibility, that 
mentalistic predicate u cannot be reduced by any physicalistic pred- 
1cate? It does not follow In order to derive the thesis of nonreduc- 
ibility an additional premise ıs needed, namely, that, if none of m, 
To, T, predicate reduces p, then there 1s no other physicalistic pred- 


8 [n the terminology of Sydney Shoemaker, P, P5, and so on are total realizations 
of M—"Some Varieties of Functionalism,” Philosophical Topics, x11 (1981) 83-118 
Physicalism commits one to the existence of such total realizations Shoemaker 
correctly distinguishes zotal realizations from what are usually taken to be realiza- 
tions, which he calls core reahzations Core realizations are physical properties that 
when combined with certain structural properties realize a mental property For example, 
according to philosophical lore, the physical property undergoing C-fiber stimulation 
when instantiated in a system with the structure of the human central nervous 
system realizes the mental property pam But undergoing C-fiber stemulation, when it 
1s instantiated in a different sort of physical structure, is not sufficient for pain Thus, 
core realizations of mental properties are, strictly speaking, not realizations of mental 
properties 

9 Suppose, for reductio, that M is identical to P, By hypothesis, if an object o 
instantiates Ps, then it 1nstantiates M But M ıs identical to P, So, contra the 
assumption of metaphysical incompatibility, o instantiates both Pj and P. So Mis 
not identical to P, Note that a more detailed statement of the argument would 
require a relativization to times, as an object o might instantiate P; at one ume, yet 
not at another 
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icate 7* that predicate reduces u If this additional, and rather 
strong, premise 1s granted, then the thesis of nonreducibility follows 

Here ıs where the reductionist can employ the “disjunctive stra- 
tegy” The reductionist can clam that mentalistic predicate u 1s 
reduced by the exhaustive disjunctive predicate (m; V To V 7,) where 
each 7, designates P, and all the P, designated by some disjunct 7, 
exhaust the realization base of M In other words, the reductionist can 
reject the rather strong additional premise, and maintain that there 
1s some physicalistic predicate that designates M, namely, the disyunc- 
tive predicate (77, V To V 7,) The reductionist who employs the 
disjunctive strategy thus maintains that a disyunctive bridge principle 
of the form 


(VBP) Vx [ux €» (mx V ox V. ,x)]!9 


1s sufficient for the predicate reduction of jx by (a,x V ToX VW T,X) 


II FODOR, KIM, AND THE THREAT OF RAMPANT ILLEGITIMACY 
I shall review here the debate between Fodor and Jaegwon Kimi! 
concerning the disjunctive strategy I shall argue that neither Fodor's 
nor Kım’s position 1s acceptable Fodor argues that, despite the 
necessary truth of bridge principles such as (\/BP), the reductionist’s 
disjunctive strategy fails This because disjunctive predicates such as 
(mx V Tox V T,X) fail to designate legitimate "scientific kinds," 
though mentalistic predicates such as ux do designate legitimate 
“scientific kinds ”!* Fodor concludes that psychology and the other 


10 The modal strength of such disjunctive bridge principles depends upon the 
modal strength of the realization relation For example, if ‘P realizes F’ entails ‘If 
object or event o instantiates P, then it is metaphyswally necessary that o mstantiates F’, 
then such disyunctive bndge principles hold with metaphysical necessity Thus, such 
disjunctive bridge principles will hold with at least nomological necessity, as nomo- 
logical necessity 1s, I assume, the weakest grade of modality in terms of which the 
realization relation could be plausibly defined 

See Kim, “Multiple Realization and the Metaphysics of Reduction,” in Superve- 
mence and Mind (New York Cambndge, 1993), pp 309-35, and Fodor 

7? A predicate might fail to designate a legitimate property, or ın Fodor's terms, 
fail to determine a scientific kind, either because 1t fails to designate a property at 
all, or because ıt designates a property that 1s in some way illegitimate or unscien- 
tific It 1s not clear to me what a real, yet illegiumate, property would be, and thus 
I do not see what the motivation for allowing for real, yet illegitimate, properties 
could be It seems to me that the theoretical role played by a real, yet illegitimate, 
property would be better played by a concept that does not correspond to a real 
property Kim seems to agree with me on this point, as he seems not to allow for 
real, yet illegitimate properties (op cu, pp 334-35) But, unfortunately, Fodor (op 
cit) seems to allow for real yet illegitimate properties, properties that do not 
correspond with “kinds” At any rate, the distinction between existent yet illegiti- 
mate properties, and nonexistent properties 1s not central to my concerns, and thus 
I shall ignore it as much as 1s possible 
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“special sciences" are autonomous in the sense that the laws of these 
special sciences cannot be reduced to the laws of physics Kim re- 
sponds by arguing that, given the necessary truth of (\/BP), Fodor 
cannot coherently accept ux as designating a legitimate kind and 
reject (mıx V Tox V T,„x) on the grounds that ıt does not designate 
a legitimate kind, as Kim puts it “It ıs difficult to see how one could 
have ıt both ways—that ıs, to castigate [(7x V ex V 1,x)] as 
unacceptably disjunctive while insisting on the integrity of [wx] as 
[designating] a scientific kind” (op c, p 324) But Kim agrees with 
Fodor that (mıx V Sx V T,X) does not designate a legitimate kind, 
and thus Kim concludes that yx also fails to designate a legitimate 
kind I shall argue that Kim 3s right to criticize Fodor, Fodor's reasons 
for claiming that (mıx V Tox V T,x) 1 unacceptable while jx 1s 
acceptable are confused, and thus Kim 1s right to insist that the two 
predicates stand or fall together But J shall argue that the price of 
agreeing with Kim that the two predicates fall together 1s extremely 
high, it 1s the price of denying the legitimacy of all, or almost all, 
properties Therefore, neither Fodor’s nor Kim’s position 1s accept- 
able 

Fodor objects to the disjunctive strategy on the grounds that dıs- 
junctive predicates such as (mıx V Tax V x) do not designate 
“scientific kinds " Suppose that there ıs a (perhaps nonstrict) law of 
psychology of the form 


(PL) CIVx (ux — pax) 


where ‘L]’ designates a nomological necessity operator, and u4x and 
Hox are atomic mentalistic predicates (Perhaps (PL) 1s a law in the 
“ideal” theory of psychology) And now consider the disjunctive 
bridge principles which the reductionist utilizing the disjunctive strat- 
egy claims would serve to predicate reduce ux and pox 


(BP1) OV [ux < (mix V Tox V. 7,x)] 
(BP2) OV x [pox €» (mix V 5x V. ax )] 


Given (BP1) and (BP2), the following physecahstec law 1s equivalent to 
(PL) 


(PL*) OVx [(m x V Tox V. myx) > (mx V vex V 7,3)] 


If (PL) 15 a law (and Fodor thinks that there are, or at least could be, 
such laws), then ıt would seem that (PL*) ıs also a law (There is no 
problem substituting inside the nomological necessity operator ‘O’, 
as (BP1) and (BP2) guarantee that xand (mx V sx V. Tx), and 
Box and (Tix V «5x WV T,X), are at least nomologically coextensive ) 
But if (PL*) 1s a law, then the reductionist carries the day, for from 
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physicalistic law (PL*) together with bndge principles (BP1) and 
(BP2), psychological law (PL) can be derived, and ın this way psy- 
chology could be reduced to a more basic physicalistic theory 

How does Fodor propose to preclude the purported (law) reduc- 
tion of (PL) to (PL*)? Fodor claims that the predicates constituting 
the antecedent and consequent of a law must be “kind predicates ” A 
“kind predicate” for Fodor 1s, apparently, a previously formulated pred- 
1cate that appears ın the previously formulated laws of some true 
theory Thus, as Fodor would have it, px and px are “kind predi- 
cates” as, by assumption, they appear ın previously formulated laws of 
a true psychological theory And consequently (PL) is a law, as the 
predicates constituting its antecedent and consequent are “kind pred- 
icates ” But, as (17x V arx V. Tr,x) and (a,x V arx V T,X) are not 
previously formulated predicates that appear 1n a previously formu- 
lated law of some true theory, they are not "kind predicates" and, as 
a result, (PL*) is not a law So psychology cannot be reduced to a 
physicalistic theory by deriving laws such as (PL) from disjunctive 
physicalistic “generalizations” such as (PL*) 13 

The upshot 1s that Fodor requires for the reduction of mentalistic 
predicate uxa previously formulated physicalistic predicate that ıs (at 
least) nomologically coextensive with ux In other words, Fodor is 
more-or-less requiring for the reduction of psychology (biology, and 
the like) to physics that the theories of these special sciences turn out 
to be notational variants of part of the reducing physicalistic theory 
The chances of this sort of correlation obtaining between theories in 
the special sciences and physics are, as Fodor suggests, slim But this 
ought not console the defender of NRP, for Fodor’s requirements for 
reduction are extremely strong, and go far beyond anything required 


13 Fodor also objects to the "lawhood" of (PL*) on the grounds that “‘it’s a law 
that ____’ defines a non truth functional context” (op cit, p 140) Fodor argues as 
follows “ one may not argue from ‘it’s a law that P brings about R’ and ‘it’s a law 
that Q brings about S’ to ‘it’s a law that P or Q brings about R or S’ [Flor 
example, it 1s a law that the irradiation of green plants by sunlight causes carbo- 
hydrate synthesis, and ıt 1s a law that friction causes heat, but [it is not] a law that 
(either the irradiation of green plants by sunlight or friction) causes (either 
carbohydrate synthesis or heat)” (sbid, p 140) 

Fodor 1s correct that “it’s a law that ___’ defines a non truth functional context,” 
and thus the general inference Fodor describes 1s invalid. But this does not block the 
disyunctive strategy In order to block the disjunctive strategy, Fodor must do more than 
show that the inference from ‘it’s a law that P brings about R and ‘it’s a law that Q 
brings about $’ to ‘it’s a law that P or Q brings about Ror S$’ 1s in general mvalid To 
block the disjunctive strategy, he must show that this inference 1s invalid when Pand Q 
exhaust the realization base of a property M,, and Rand S exhaust the realization base 
of Ma, and it 1s a proper law that M, brings about M, Fodor has given us no reason to 
be suspicious of the inference in this very special circumstance 
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by Nagel’s classic model of reduction, or any other model of reduc- 
tion I know of 14 Moreover, there are several independent reasons for 
rejecting Fodor’s requirement that for jux to be reduced, ıt must be 
reduced by a previously formulated physicalistic predicate 

First, as was explained above, the issue of predicate reduction 1s—at 
least in the context of the mind-body problem—really the issue of 
property identity If reduction ıs to have the ontological conse- 
quences that the type-type identity theorist desires and the defender 
of NRP fears, then reducing bridge principles must entail, or at least 
warrant the assertion of, property identities Thus, what matters for 
predicate reduction 1s what the predicates designate, not that they 
have been previously formulated So Fodor cannot reject the disyunc- 
tive predicate (71x V Tox V 7,x) merely on the grounds that it has 
not previously been used to state scientific laws The familanty of the 
predicate is irrelevant, what is relevant 1s what property, if any, the 
predicate designates 

At this point, one might object that my interpretation of Fodor’s 
reason for rejecting (mıx V 7sx V 7,x) 1s uncharitable For perhaps 
Fodor rejects such disjunctive predicates not merely because they 
have not previously been used to state scientific laws, but rather 
because such predicates fail to designate legitimate properties or 
“natural kinds” This objection has some merit, 1n some passages, 1t 
seems that this is what Fodor has in mind !5 The problem with this 
interpretation of Fodor, however, 1s that ıt plays directly into the 
hands of Kim’s response 


14Kim also criticizes Fodor on the grounds that he requires too much for 
reduction (see op cit, footnote 21) In footnote 2 of “Special Sciences,” Fodor 
claims that he 1s working with what he takes to be the “classical form” of reduction, 
though he also admits that 1t 1s “a stronger one than many philosophers of science 
hold” He also claims that his argument against reductionism would hold against 
even “liberalized versions” of reduction I take myself to have demonstrated that 
this latter claim 1s false 

15 Fodor introduces the phrase ‘kind predicate’ with regard to “(natural) kinds” 
(op ct, p. 132), thus suggesting that a predicate 1s an acceptable “kind predicate" 
if and only if it designates, or “determines,” a “natural kind,” where the criteria for 
being a natural kind are independent of any linguistic entity such as a science 
and/or theory Moreover, he seems to think that disjunctive predicates do not 
designate “natural kinds”, at one point Fodor states, “I doubt that ^is either carbo- 
hydrate synthesis or heat’ ıs plausibly taken to be a kind predicate" (op ct, p 140) 
But Fodor later drops the phrase ‘natural kind’ and rejects the reductionist's appeal 
to the disjunctive predicate (7x V Tox  7,x) on the grounds that it “is not a kind 
predicate in the reducing science" (op cit, p 138) In other words, Fodor seems to 
begin with the plausible requirement for reduction that the predicates in a bridge 
principle must designate “natural kinds,” but then slides to the implausibly strong 
requirement that the predicates in a bridge principle be previously formulated 
predicates used to state the laws of the reducing science and/or theory 
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If pain ıs nomically equivalent to N, the property claimed to be wildly 
disjunctive and obviously nonnomic, why isn't pain itself equally heteroge- 
neous and nonnomic as a kind? Why isn't pain’s relationship to its real- 
ization bases, Nb, Nr, and Nm, analogous to Jade's relationship to Jadeite 
and nephrite? If jade turns out to be nonnomic on account of its dual 
“realizations” ın distinct microstructures, why doesn’t the same fate 
befall pain? After all, the group of actual and nomologically possible 
realizations of pain, as they are described by the MR enthusiasts with 
such imagination, ıs far more motley than the two chemical kinds 
comprising jade (op at, p 323) 


Kim’s criticism then, is this if Fodor 1s interpreted as advancing the 
plausible requirement for reduction that the predicates in a bridge 
principle designate legitimate properties, or “natural kinds,” then he 
cannot reasonably assert that mentalistic predicate ux designates a 
natural kind, while denying that the exhaustive disjunctive predicate 
(mix V "rx V a,x) designates a natural kind For, given the neces- 
sary truth of 


(VBP) Vx [ux €» (mx V Tox V. 7,x)] 


if (mx V Tox V T,X) fails to designate, or determine, a natural kind, 
then so does ux Moreover, Kim argues that there 1s good reason to 
deny that heterogeneous disjunctive predicates such as (74x V Tox 
V Tx) designated natural kinds Kim does not object to such 
predicates merely because they are unfamiliar, rather, he objects to 
both wx and (mıx V Tex V T,x) for semantic and metaphysical 
reasons “Given that mental kinds are realized by diverse physical 
causal kinds, it follows that mental kinds are not causal kinds, and 
hence are disqualified as proper scientific kinds. Each mental kind is 
sundered into as many kinds as there are physical realization bases for 
it, and psychology as a science with disciplinary unity turns out to be 
an impossible project" (op cit, p 327) 

In the next section, I defend the legitimacy of “disyunctive proper- 
ties” from arguments such as Kim’s Here, I wish only to motivate my 
defense by highlighting an unsavory consequence of Kim’s argument 
It 1s clear that his argument against the legitimacy of mental proper- 
ties generalizes into an argument for the legitimacy of all multiply 
realized properties, ıt ıs irrelevant to the argument that M ıs a mental 
property—all that matters is that M ıs multiply realized For every 
multiply realized property designated by some predicate px there 1s, 
or there could be, a corresponding exhaustive disyunctive predicate 
(mix V Tox V. 1,,x) that is at least nomologically coextensive with px 
So, 1f no disjunctive predicate of the form (a,x V Tax V T,X) 
determines a “scientific kind” or designates a legitimate property, 
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then no predicate such as px designates a legitimate property But 
this 1s very problematic because all, or almost all, properties are 
muluply realized If the fact that many different sorts of creatures can 
instantiate being tn pain leads us to believe that bang tn pan is multiply 
realized, then the fact that many different sorts of things can instan- 
tate, for example, bemg green ought to lead us to believe that bemg 
green 1s multiply realized The point also holds for paradigmatic 
physical properties Consider the paradigmatic physical property of 
having a mass of two grams This property 1s instantiated by many 
different kinds of objects—bits of paper, bone, metal, jelly, and so on 

Moreover, we expect that, if an object (or event) o mstantiates a 
property P, then there 1s some noncausal explanation as to why o 
mstantiates P, there must be something about o in virtue of which o 
instantiates P That ıs, there must be some property R of o such that 
R realizes P (So much is required by the “principle of sufficient 
reason," at least under one formulation of that principle ) For exam- 
ple, 1f a bit of plastic has a mass of two grams, then it instantiates that 
property in virtue of being composed of a certain number of certain 
sorts of molecules structured ın a certam way And if some very 
different sort of object o', say a bit of bone, also has a mass of two 
grams, then o' 1nstantiates this property in virtue of being composed 
of a certain number of a certain other sorts of molecules structured 
in some other way Claiming that a property Pis not multiply realized 
1s clarming that there are no properties in virtue of which objects 
instantiate P, 1t ıs tantamount to claiming that when an object instan- 
tiates Pit 15 just a “brute fact” that ıt does so But most philosophers 
are resistant to positing brute, unexplainable, facts, and to the extent 
that they are resistant to brute facts, they ought to be accepting of the 
claim that most if not all properties are multiply realized 16 


16 One might object to the theses that most properties are multiply realized on the 
grounds that only “higher-order” properties are multiply realized This objection, 
however, fails to preclude rampant illegitimacy because, as the notion of a “higher- 
order property" is usually defined, almost all properties are higher-order properties 
Louise Antony and Joseph Levine define a higher-order property as “a property you 
have ın virtue of having some other property that meets certain specifications” —see 
“Reduction with Autonomy,” Philosophical Perspectives, x1 (1997) 83-105, here p 85 
The problem 1s that almost every property 1s associated with some sort of causal/ 
functional role, and thus meets this criterion Similarly, one can apply the Ramsey/ 
Lewis method to define almost any predicate For similar accounts of functional/ 
higher-order properties, see Kim “The Mind-Body Problem Taking Stock After 
Forty Years,” Philosophical Perspectives, x1 (1997) 185-205, Block “Can the Mind 
Change the World?" in George Boolos, ed , Meaning and Method Essays in Honor of 
Hilary Putnam (New York Cambridge, 1990), pp 137-70, and Putnam 
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If the above considerations are correct and thus most, if not all, 
properties are multiply realized, then the advocate of NRP has a 
serious problem For jf, following Fodor, she rejects the reductionist’s 
appeal to exhaustive disyunctive predicates on the grounds that such 
predicates do not designate legitimate properties, then she is evi- 
dently required to deny the legitimacy of all muluply realized prop- 
erties The price of rejecting the reductionist's appeal to “disyunctive 
properties” 1s rampant illegitimacy The entire argument 1s explicitly 
formulated as follows 


(1) For each multiply realized property R designated by predicate px, 
there ıs an exhaustive disyunctive predicate (a,x V Tox V T,X) 
such that each Tx designates P, where P, realizes R, and (qx V Tox 
V 7,x) contains one disjunct for each property that realizes R.17 

(2) pxis at least nomologically coextensive with (7x V Tex V. T,X) 


But since 


(3) If px and (mx V 45x V T,„x) are (at least) nomologically coexten- 
sive, then px designates a legitimate property only if (mx V Tox 
V Tx) designates a legitimate property 


1t follows that 


(4) px designates a legitimate property only if (mx V Tox V T,X) 
designates a legitimate property 


But in order to undermine the disjunctive strategy and thereby avoid 
reductionism, the defender of NRP has claimed, following Putnam 
and Fodor, that 


(5) Such heterogeneous disjunctive predicates as (mx V Tox V T,X) 
do not designate legitimate properties 


Therefore 

(6) px does not designate a legitimate property 
But 

(7) All, or almost all, properties are multiply realized 
And consequently 


(8) Few, 1f any, of our predicates designate legitimate properties 


17 This premise presupposes that all the properties ın R’s realization base can be 
designated by predicates This assumption probably must be rejected by anyone 
who is a realist about properties, and this suggests an alternative response to the 
argument Related issues will be discussed in section 1v 
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The conclusion, (8), ıs unacceptable, and the argument ıs valid 
Moreover, I have argued that the only dubious premise 1s (5) Thus 
there 1s cogent reason for hoping that at least some exhaustive 
disjunctive predicates such as (mıx V məx V x) do designate 
legitimate properties (or at least they would if they were formulated), 
the price of denying this 1s that of accepting the unsavory conclusion 
that all, or almost all, propertes are illegitimate If rampant illegiti- 
macy 1s to be avoided, an acceptable account of disjunctive properties 
must be formulated 


III DEVELOPMENT OF A CAUSAL POWER ACCOUNT OF DISJUNCTIVE 
PROPERTIES 


Before an acceptable account of disyunctive properties can be formu- 
lated, it must be made clear what it would be for a property to be a 
disyunctive property I have been, and shall continue to be, assuming 
a naive realism concerning properties, properties are mind-indepen- 
dent entities, at least some of which are designated by predicates A 
disjunctive property, one quickly thinks, ıs what one gets when one 
takes property P and property Q and then forms their disjunction 
But this account involves a use/mention confusion one can form the 
disjunction of predicate @ and predicate ij, namely, [o V yi, but one 
cannot literally form a disjunction of properties any more than one 
could form a disjunction of, say, particular people !? Let us define 
the notion of a properly disyunctwe predicate a disjunctive predicate 
(m, V Ta V T,) 15 a properly disjunctive predicate 1f and only if (1) 
there ıs more than one disjunct 7,, (1) each disjunct 7, designates a 
legitimate property, and (ui) each 7, designates a distinct property 
The notion of a disjunctive property is now defined as follows P ıs a 
disjunctive property 1f and only if P can be designated by a properly 
disjunctive predicate (In terms of the lambda calculus, Pis a disyunc- 


18 In a puzzling passage ın “Concepts of Supervemience,” in. Supervenience and 
Mind, Kim eschews infinite disjunctive predicates, yet endorses the “operation” of 
disjunction applied to infinitely many properties “ such operations as infinite 
conjunctions and infinite disyunctions would be highly questionable for predicates, 
but not necessarily for properties—any more than infinite unions and intersections 
are for classes The property of being less than one meter long can be thought of 
as an infinite disyunction (e g of all properties of the form being less than n/n+1 
meters long, for every natural number n)” (p 73) 

I share Kim's uneasiness concerning infinite predicates But, while it 1s relatively 
clear what a disjunction of predicates i5, 1t 1s not at all clear what a disjunction of 
properties 1s 
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tive property 1f and only if for some predicates $ and g designating 
distinct legitimate properties, Pis designated by! Ax(hx or gx or )!19 

Thus the nonreductionist's claim that there are no disjunctive 
properties (premise (5) ın the above argument) ıs interpreted to 
mean that properly disunctive predicates do not designate legitimate 
properties, such predicates are either “empty” or designate illegiti- 
mate, “unscientific,” properties Note that the exhaustive disjunctive 
predicates by which the reductionist proposes reducing mentalistic 
predicates are properly disjunctive predicates, they satisfy conditions 
(1)-G1) above Consequently, since all or almost all properties are 
multiply realized, the nonreductionist's claim that there are no legit- 
mate disjunctive properties entails rampant illegitimacy 

My strategy for formulating an acceptable account of disjunctive 
properties will be to explicate and criticize two of David Arm- 
strong's?? influential arguments against the legitimacy of disyunctive 
properties These negative arguments serve as constraints on an 
acceptable account of disyunctive properties in the sense that—at the 
cost of rampant illegiimacy—an acceptable account of disjunctive 
properties must allow at least some disjunctive properties to survive 
these negative arguments Thus, in responding to these arguments, I 


19 One could also provide a nonmetalinguistic account of disjunctive properties as 
follows let properties be functions from possible worlds to sets of entities in those 
worlds, that 1s, property Pis a function f ( ) such that for worlds w, f (w) = {x xhas 
Pin w} The disyuncnon f,(w) of properties f, ( ) and f;( ) 1s defined as follows for 
all w, fa(w ) = f, (w) U fa(w) A property could then be defined as a disjunctive 
property as follows property Pis disjunctive if and only if there are properties Q and 
R such that Ps the disunction of Qand R To avoid the result that all properties 
are disjunctive, various restrictions could be placed on Q and R, for example, it 
might be required that Q and R be “natural” or “basic” properties 

The problem with this account of disjunctive properties, aside from that of 
specifying the “natural” properties, is that ıt 1s not clear how to relate ıt to the debate 
surrounding the argument from multiple realizability Note that prima facie there 
1$ no reason to suppose that disunctive predicates such as (mx V max V yx) 
designate disyunctive properties ın this sense Nor ıs there reason to suppose that 
atomic predicates do not designate such disjunctive properties 

20 A Theory of Unwersals Unwwersals and Scientific Reahsm, Volume II (New York 
Cambridge, 1978) For other arguments against digunctive properties, see D H 
Mellor, “Predicates and Properties,” in Mellor and Alex Oliver, eds , Properties (New 
York Cambridge, 1997), David Owens, “Disyunctive Laws,” Analysts, x. (1989) 
197-202, and Kum, “Multiple Realization and the Metaphysics of Reduction,” in 
Supervenrence and Mind, pp 309-35 Mellor's argument 1s successfully criticized by 
Andrew Botterell ın “Mellor on Negative Properties,” The Philosophical Quarterly, 
XLVIII, 193 (1998) 523-26 Owens and Kum’s argument 1s successfully criticized by 
Fodor in “Spectal Sciences Still Autonomous After All These Years,” ın In Cniscal 
Condition Polemical Essays on Cognitwe Science and the Philosophy of Mund (Cambridge 
MIT, 1998), pp 11-12 
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shall be taking significant steps toward the development of an accept- 
able account of disjunctive properties 
Armstrong's first argument against disjunctive properties 1s this 


disjunctive properties offend against the principle that a genuine prop- 
erty 1s identical ın its different particulars Suppose a has a property P, 
but lacks Q while bhas Q but lacks P Itseems laughable to conclude that 
from these premises that a and b are identical in some respect Yet both 
have the “property”, P or Q (op ct, p 20) 


This argument seems persuasive when applied to the sort of ex- 
tremely “heterogeneous” disjunctive predicate Armstrong had in 
mind Consider his favorite example ‘is a raven or a writing desk’ It 
would be at least odd to say that the entities in the extension of ‘is a 
raven or a writing desk’ all resemble each other, or “are identical in 
some respect"?! Thus the argument shows that not all properly 
disjunctive predicates designate properties But ıt by no means fol- 
lows that no properly disyunctive predicates designate properties 
Consider, for example, the (perhaps infinite) disyunctive predicate 
(xix V Xox V. X43) where each disyunct x,x designates a distinct color 
property, C,, and every color 1s designated by some x, (Thus (1x V 
XsX V. X4X) 15 something like, ‘1s either red or blue or green or yellow 
or orange ') This predicate satisfies requirements (1)-(u1) for being 
a properly disjunctive predicate Thus, if it designates a property, then, by 
the above definition of disjunctive property, ıt designates a disyunctive 
property The property beng blue (or being a very particular shade of 
blue) is one of the C,, and beng blueis a determinate of the determinable 
property beng colored—bemg blue xs one of the ways in which a thing 
can be colored, and if a thing 1s blue, then 1t must be colored To put 
the point in more relevant terminology, beng blue realizes being colored, 
more generally, determnables with more than one determinate are one 
sort of multiply realized property ?? Consequently, because the prop- 
erly disjunctive predicate (X1x V X2% V X45) 1S exhaustive, 1t 15 nec- 
essarily true that it 1s satisfied by all and only colored objects 
Therefore, assuming that the entities that instantiate beg colored all 


21 Jt would not, however, be false to say this For example, all ravens and all 
writing desks have mass, and thus are identical in this respect But it suits my 
explanatory purposes to grant this point to Armstrong, at least with regard to his 
first argument 

22 The isomorphism between realized and realizing properties, on the one hand, 
and determinables and determinates, on the other, was, I believe, first noticed by 
Stephen Yablo, who utilizes the isomorphism to explain the causal relevance of 
mental events—“Mental Causation,” The Philosophical Review, c1 (1992) 245-80 My 
views concerning disjunctive properties are significantly influenced by Yablo’s 
insights 
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resemble each other in some respect, there ıs a respect in which all 
objects m the extension of (Xx V Xox V. X,x) resemble each other, 
namely, bemg colored 

The same response, mutatis mutandis, illustrates that the exhaus- 
tive properly disjunctive predicate (7x V Tax V. T,X), by which the 
reductionist proposes reducing mentalistic predicate x, also serves 
as a counterexample to Armstrong’s first argument For ux and (qx 
V Tax V. T,X) are also necessarily coextensive Consequently, there 
1$ a “respect,” namely, M, in whic all the entities in the extension of 
(mix V Tex V a,x) resemble each another As 1s required to steer 
clear of rampant illegitimacy, this most relevant case serves as a 
counterexample to Armstrong's first argument against disjunctive 
propertes, and thus his argument does not demonstrate that all 
properly disyunctive predicates fail to designate legitimate properties 

Although Armstrong’s first argument fails to support the nonre- 
ductionist's rejection of disjunctive properties, 1t does serve to high- 
hight a significant constraint on disjunctive properties His first 
argument fails to demonstrate that (7x V ox V 7,,x) designates a 
legitimate property because the 7,x which are disjoined to form (a,x 
V Tox V. T,%) overlap on a property Consider again all specific color 
predicates and the determinable beng colored All specific color pred- 
icates can be said to overlap on the property beng colored because an 
object that satisfies any one of the specific color predicate must 
instantiate beng colored Generalizing now, the disjuncts of a disyunc- 
tive predicate can be said to overlap if and only if there 1s some 
property R such that every possible object (or event) that satisfies any 
of the disuncts must 1nstantiate R It 15 because its disjuncts overlap 
on M that (mıx V max V ,x) survives Armstrong's first argument 
because the disjuncts overlap on M, there ıs a “respect” m which all 
the objects (or events) in the extension of (mx V TX M T,X) 
resemble each other, namely, M Although Armstrong's first argu- 
ment does not succeed in demonstrating that there are no disjunctive 
properties, ıt does succeed in demonstrating that a necessary condi- 
tion for a properly disjunctive predicate's designating a legitimate 
property is that the disjuncts of the predicate overlap on a property 

Armstrong's second argument against disjunctive properties 1s 
this 


the postulation of disjunctive properties breaks the link which it 1s 
natural to make between properties of things and causal powers of 
things Suppose that a has P but lacks Q The predicate ‘P\/Q’ applies 
to a Nevertheless, when a acts, it will surely act only in virtue of its being 
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P Its beng P or Qwill add no power to its arm This suggests that beng 
P or Qis not a property (op cu, p 20) ?? 


This argument presupposes a causal power conception of properties, 
the essentials of which Armstrong describes ın this passage. 


(a) The active and passive powers of particulars are determined by their 
properties (b) Every property bestows some active and/or passive pow- 
er[s] upon the particulars of which it 1s a property (c) A property 
bestows the very same causal power[s] upon any particular of which it 1s 
a property (d) Each different property bestows a different [set of] 
power[s] on the particulars of which it 1s a property (op ct, pp 43-44) 


(I have amended Armstrong’s conception slightly to allow properties 
to bestow nonempty sets of causal powers rather than just single 
causal powers It 1s not clear whether or not Armstrong would accept 
this amendment **) 

Armstrong does not extend this conception of properties as be- 
stowing causal powers to an account of the conditions under which a 
predicate designating a legitimate property 1s satisfied by an object, 
but 1t 1s relatively clear how the conception ought to be so extended 
Legitimate properties bestow causal powers on the objects that in- 
stantiate them, an object o mstantiates a legitimate property P if and 
only if o possesses every causal power bestowed by P It will simplify 
matters 1f, 1nstead of speaking of properties “bestowing” causal pow- 
ers, properties are simply identified with sets of causal powers Thus, 
I shall sometimes speak of a property being constituted by a set of 
causal powers The causal power model of properties together with 
this simplifying assumption implies the following necessary and 
jointly sufficient conditions for a predicate’s designating a property 


Predicate m designates a property P :f and only if there 15 some nonempty 
set of causal powers p such that (a) if a particular o satisfies 7 then o 
possesses every power in p, and the converse (b) if a particular o 
possesses every power 1n f, then o satisfies 7r 


Armstrong's second argument again is cogent when applied to 
extremely heterogeneous properly disjunctive predicates such as his 
example, ‘is a raven or a writing desk’ The problem is that this 


35 This 15 actually Armstrong's third argument against disjunctive properties, but 
his second argument is neither successful nor relevant to my purposes 

24 These amendments bring Armstrong's causal power model of properties more 
in line with the model proposed by Shoemaker, see “Causality and Properties,” in 
Peter van Inwagen, ed, Teme and Cause (Boston Reidel, 1980), pp 229-54, and 
“Causal and Metaphysical Necessity,” Pacific Philosophical Quarterly, LXXIX (1998) 
59-77 My views on these matters owe much Shoemaker's work 
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predicate does not meet conditions (a) and (b) That is, there 1s no 
set of causal powers d such that (a) every particular that satisfies ‘1s a 
raven or a writing desk’ possesses every causal power ın d, and (b) 
every particular that possesses every causal power ın d satisfies ‘is a 
raven or a writing desk’ Let r be the set of causal powers associated 
with the predicate ‘is a raven’, and w be the set of causal powers 
associated with ‘is a writing desk’ Set of causal powers d cannot be 
identified with 7, because writing desks satisfy ‘is a raven or a writing 
desk’ yet do not possess all the causal powers in r Thus condition (a) 
1s violated. Nor can d be identified with w, for ravens satisfy ‘1s a raven 
or a writing desk’ yet do not possess all the causal powers 1n w Thus 
condition (a) is again violated And a fortion d cannot be identified 
with rU, because many ravens satisfy ‘1s either a raven or a writing 
desk’ but do not possess all the powers in rUw, and many wniting 
desks also satisfy ‘is a raven or a writing desk’ but do not possess all 
the powers in rUw Clearly, then, in order to satisfy condition (a), d 
must be a subset of both rand w Perhaps then dcan be identified with 
rÜw Note that it 15 unlikely that rN w ıs the empty set For there are 
some causal powers shared by all possible ravens and writing desks For 
example, surely every possible raven or writing desk has mass, and 
having mass 1$ a property, and thus everything that instantiates this 
property must possess every member of a nonempty set of causal 
powers So rNw ıs not the empty set But nonetheless d cannot be 
identified with rw The problem ıs that it ıs likely that there are 
many possible things that possess all the causal powers in rN w, but do 
not satisfy the predicate ‘is either a raven or a writing desk’ If an 
object o possesses all the causal powers that all ravens and writing 
desks have ın common, it does not follow that o1s either a raven or o 
1s a writing desk There are, for example, many things that possess all 
the causal powers bestowed by the property having mass that are 
neither ravens nor writing desks And thus d cannot be identified 
with rw on pain of condition (b) being violated Moreover, and a 
fortiori, d cannot be identified with any proper subset of rOw If 
there are objects that possess every causal power in rNw yet do not 
satisfy ‘is either a raven or a writing desk’, then for any proper subset 
of rw there will be objects that possess every causal power ın it, but 
do not satisfy ‘1s either a raven or writing desk’ 

So Armstrong’s second argument, like his first argument, succeeds 
in demonstrating that ‘is either a raven or a writing desk’ fails to 
designate a legitimate property But his second argument, again like 
his first argument, does not demonstrate that no properly disjunctive 
predicate designates a property Consider again the exhaustive dis- 
Junction of color predicates, (xix V Xox V. Xx), where each disjunct 
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xx designates a distinct color property, C,, and each such property C, 
1s constituted by a set of causal powers c, If (xix V Xox V X4X) 
designates a legitimate property, there must be some nonempty set of 
causal powers d' such that (a) every possible thing that satisfies (xx 
V Xex V  X4X) possesses every causal power in d', and (b) every 
possible thing that possesses every causal power in d' satisfies (x1x V 
Xox V. X,x) Is there a set of causal powers that meets conditions (a) 
and (b)? In this case, there 1s good reason to suppose that there 1s 

Specifically, there 1s good reason to suppose that the intersection of 
the all the sets of causal powers constituting the properties designated 
by the disyuncts, namely, 4c Mc,, meets both conditions 

First, assuming the causal power model of properties, 1t 1s incon- 
testable that cf1c Ne, meets condition (a) Suppose an object o 
satisfies (xx V Xox V. X,x) Then o must satisfy one of the disjuncts 
x,x But if o satisfies one of the disyuncts y,x, then ıt must instantiate 
the property C, designated by that disjunct, and thus it must possess 
every causal power 1n the set of causal powers c, that constitute C, But 
if o has every causal power in some set ¢,, then o necessarily has every 
causal power 1n (MG Ne, So «1c Me, meets condition (a) 

It is not incontestable that ¢,¢& Mc, meets condition (b) because 
the argument that 1t does so presupposes a particular account of the 
realization relation This account of realization, however, 1s strongly 
suggested by the causal power model of properties According to the 
working definition of realization, a property P realizes a property Qif 
and only if (1) 1f an object (or event) oinstantiates P, then, necessarily, 
o instantiates Q, and (n) o's being P explains—in a metaphysical 
sense—o's being Q, being Pis one of the ways «n which a thing can be 
Q On the causal power model of properties, an object instantiates a 
property 1f and only if ıt possesses every causal power ın the set that 
constitutes that property So, putting these ideas together leads nat- 
urally to the following general definition of realization 


P realizes Q if and only i£. (def), where p and q are the sets of powers 
constituung P and Q, qC p 


Consider the relation that obtains between the determinate prop- 
erty beng yellow and the determinable of that determinate, bemg 
colored Because property P’s being a determinate of a determinable 
property Q is sufficient for P’s realizing Q, beng yellow realizes beng 
colored On the causal power model of properties, the properties bemg 
yellow and beng colored are constituted by sets of causal powers Let us 
call these sets of causal powers y and c, respectively Now consider all 
the things that instantiate beng yellow These things are yellow ın 
virtue of having all the causal powers 1n set y And consider all the 
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things that instantiate beng colored These things are colored in virtue 
of possessing all the causal powers ın set c What relation obtains 
between cand y? Clearly, cis a proper subset of y every causal power 
something possesses in virtue of instantiating beng colored 1s also a 
power ıt possesses 1n virtue of instantiating beng yellow But not every 
power a thing possesses ın virtue of instantiating being yellow 1s a power 
it possesses in virtue of instantiaung beng colored (For example, 
yellow warning signs are visible to drivers at night, but blue, green, 
and purple signs are not) And consider the relation that obtains 
between the set of causal powers constituting the mental property M 
and one of M's realizors, P) Let us identify these properties with sets 
of causal powers m and p, Again, consider all the things that instan- 
tate M These things are M in virtue of possessing all the causal 
powers in m And consider all the things that instantiate some real- 
izing property P, These things instantiate P, 1n virtue of possessing 
all the causal powers in pı What relation obtains between m and fi? 
Again, it ıs clear that m 1s a proper subset of p, every power some- 
thing possesses ın virtue of instannating M ıs also a power something 
would posses ın virtue of instantiating P) But not every power a thing 
possesses in virtue of instantiating P, 1s a power ıt would possess in 
virtue of instantiating M (For example, suppose M 1s believing that 
snow 1s white And suppose, as functionalism would have us believe, 
that this mental property can be instantiated by a machine made of 
steel, and that only such steel machines instantiate P, Such a ma- 
chine would possess the power of being magnetic in virtue of instan- 
tiating P,, but ıt would not posses the power of being magnetic in 
virtue of instantiating believing that snow ıs white ) 

If one grants the above account of realization, then cfle Ne, 
meets condition (b), that ıs, 1f an object o possesses every causal 
power in aflo Nc, then o satisfies (X1x V Xox V X,x) According 
to the above account of realization, a multiply realized property 1s 
constituted by the intersection of the sets constituting its realizors 75 


25 Let C be the set of causal powers constituting beng colored, and let cand c and 

c, be sets of causal powers constituting specific color properties Suppose Chas as 
a member some causal power cp that is not a member of ANo Mec, Then there 
would have to be a c, that did not have cp as a member But then something could 
possess all the causal powers in this c, yet not possess all the powers in C So, a thing 
could instantiate the color property constituted by this c, yet not instantiate bang 
colored But this 1s absurd, as every thing that 1s some specific color or other 1s 
colored So C cannot contain some causal power that ıs not a member of 
G6 Me, Conversely, suppose aq(1& Me, has as a member some causal power 
that is not a member of C Then a thing could have every causal power constituting 
beng colored, yet lack a causal power that 1s possessed by every object that instantates 
some specific color property or other So, an object could instanuate being colored, 
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Thus ¢,c Mc, constitutes being colored Therefore, any object o that 
possesses all of the causal powers 1n cfe Me, mstantiates beng 
colored But if o mstantiates being colored, then o must instantiate some 
specific color property P, That is, 1f o has all of the causal powers ın 
cc Nc, then o must either have all the causal powers 1n G, Or 
have all the causal powers in cy, or But if o possesses all the causal 
powers 1n some set c, then o must satisfy the corresponding disjunct 
x,x, and hence o must satisfy (sx V X2% V X43) So, assuming the 
account of realization that 1s strongly suggested by the causal power 
model of properties, (Mc Mc, also meets condition (b) 

The same arguments apply, mutatis mutandis, to the most relevant 
case of the reductionist's properly disjunctive predicate, (7,x V Tox 
V X) For each realizing physical property P, designated by a 
disyunct 77,x, let p, be the set of causal powers that constitutes P, For 
the reasons given above, the set pıNpe Mp, meets both conditions 
Condition (a) 1s met because every possible thing that satisfies (mx V 
Tox V T,X) possesses every causal power 1n fy Npe ip, and condi- 
tion (b) 1s met because every possible thing that possesses every causal 
power m PNP. Mp, satisfies (mx V Tox V a,x) And conse- 
quently, as ıs required to avoid rampant legitimacy, 1t cannot be 
concluded from Armstrong's second argument that the reductionist's 
(ax V Tox V. T,X) fails to designate a legitimate property 

Armstrong’s second argument, like hus first argument, fails to 
demonstrate that no properly disyunctive predicates designate legiti- 
mate properties But the argument serves to place an additional 
constraint on an acceptable account of disyunctive properties A 
predicate 7 designates a legitimate property P if and only if there 1s 
some nonempty set of causal powers p such that (a) 1f a particular o 
satisfies 7, then o possesses every power in f, and the converse, (b) if 
a particular o possesses every power ın p, then osatisfies 7 What must 
be true of a properly disjunctive predicate if ıt 15 meet conditions (a) 
and (b)? The constraint derived from Armstrong’s first argument Is 
that the disjuncts of a properly disjunctive predicate must overlap on 
a property Let us now redefine the notion disjuncts overlapping 1n 
terms of the causal power model of properties the disyuncts of a 
disjunctive predicate overlap on set of powers if and only if every 
object (or event) that satisfies any of the disjuncts must possess every 





yet not instantiate some specific color property But this also ıs absurd every thing 
that 1s colored 1s some specific color Therefore, the set of causal powers constitut- 
ing beng colored must be the intersection of all the sets of causal powers constituting 
specific color properties And, generalizing now, a multiply realized property 1s 
constituted by the intersection of the sets constituting Its realizors 
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causal power in that set It can now be discerned that it 1s necessary 
and sufficient for a properly disjunctive predicate's meeting condi- 
uon (a) that its disjuncts overlap on a set of causal powers What 
Armstrong’s second argument illustrates ıs that condition (b) must 
also be satisfied That is, 1t is not enough for (mx V Tx V. T,X) to 
designate a legitimate property that its disjuncts overlap on property 
M, ıt also must be the case that any object that instantiates M must 
satisfy (mx V Tax V T,X) Let us say that a properly disjunctive 
predicate that has both of these features 1s such that its disyuncts 
satisfactorily overlap 

I conclude that Armstrong's arguments demonstrate that properly 
disjunctive predicates whose disjuncts do not satisfactorily overlap fail 
to designate legitimate properties Assuming the causal power model 
of properties and the attendant account of realization, however, a 
properly disjunctive predicate whose disyuncts satisfactorily overlap 
does designate a legitimate property These results entail the follow- 
ing general account of disjunctive properties a properly disjunctive 
predicate designates a legitimate property if and only if its disyuncts 
satisfactorily overlap Moreover, ın the most relevant case of the reduc- 
tionist’s exhaustive properly disjunctive (mıx V Tax V mx) where 
each disjunct 7,x designates one of M's realizors, the disjuncts satıs- 
factorily overlap on the mtersection of the sets of causal powers 
constituting M’s realizors, namely, pf. Np, These results are 
desirable, as they provide a means for avoiding rampant legitimacy 
More specifically, because the disjuncts of the reductionist's exhaus- 
tive properly disjunctive predicates satisfactorily overlap, premise (5) 
in the argument presented in section II can be rejected 


IV THE CONSEQUENCES FOR NRP 
The causal power model of properties and the attendant accounts of 
disyunctive properties and the realization relation have significant 
consequences for NRP Recall that NRP is the conjunction of two 
theses 


Physicalism. all particulars are constituted by physical particulars, and all 
properties are realized by physical properties 

Nonreducibility mental predicates cannot be reduced by physical predi- 
cates 


The above definition of realization serves to clarify and support the 
thesis of physicalism First, the definition immediately secures the 
result, central to NRP, that mental properties are not identical to the 
physical properties that realize them, for no set 1s identical to one of 
its proper subsets Second, the definition helps to clarify that NRP 1s 
incompatible with the metaphorical claim, often associated with NRP, 
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that mental properties exist “over and above” their realizors Accord- 
ing to the above well-motivated definition, multiply realized mental 
properties, though real and causally efficacious, are better thought of 
as parts of their physical realizors The definition thus undermines 
the standard “levels” picture, according to which mental properties 
exist “at a higher level” than their physical realizors, and conse- 
quently ıt falsifies Kim's equation of NRP with some form of emer- 
gentism ? The falsification of this equation ıs an extremely beneficial 
result for NRP, for without this equation the problems forcefully 
presented by Kim concerning causal and explanatory exclusion of 
mental properties by physical properties do not arise 27 Just as there 
1s no causal and/or explanatory competition between a whole and its 
parts, so there 1s no causal and/or explanatory competition between 
instances of mental properties and instances of their physical reali- 
zors And, finally, the causal power model of properties combined 
with the above definition of the realization relation surpass the old 
notions of supervenience in providing an explanation of the relation 
between mental properties and physical properties Kim?® has re- 
cently complained that “mind-body supervenience 1s not an explan- 
atory theory, it merely states a pattern of property covariation 
between the mental and the physical, and points to the existence of 
a dependency relation between the two Yet ıt ıs wholly silent on the 
nature of the dependency relation that might explain why the mental 
supervenes on the physical” (zbid, p 190) The above definition of 
realization provides at least the foundation for an explanation of this 
dependency relation mental properties depend upon physical prop- 
erties because mental properties in part constitute certain physical 
properties Of course, all of these remarks are highly programatic, 
and carrying out the relevant programs would require a more de- 
tailed working out of the causal power model of properties than I 
have attempted here But these remarks do illustrate that advocates 
of NRP have much to gain from adopting the causal power model of 
properties and the definition of realization that ıs implied by that 
model 

Although the causal power model of properties and the attendant 
account of realization provide some support for and clarification of 


26 See Kim, “The Nonreductivist Troubles with Mental Causation,” m John Heil 
and Alfred Mele, eds , Mental Causation (New York Oxford, 1993), pp 189-210 

27 See Kim (op cit), and “The Myth of Nonreductive Materialism,” Proceedings and 
Addresses of the American Philosophical Association, LXII (1989) 31-47 

28 “The Mind-Body Problem Taking Stock After Forty Years,” Philosophical Perspec- 
tives, X1 (1997) 185-205 
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the thesis of physicalism, they seem to spell trouble for the thesis of 
nonreducibility The most prevalent argument in support of nonre- 
ducibility ıs the argument from multiple realizabihty, which argues 
that mentalistic predicates such as u cannot be reduced by any 
physicalistic predicate because mental properties are multiply real- 
ized by physical properties Agamst this argument the reductionist 
can employ the “disyunctive strategy” he can argue that, despite the 
phenomenon of multiple realization, mentalistic predicates such as p. 
are reduced to physicalistic predicates by way of disjunctive bridge 
principles of the form 


(VBP) Vx [px <> (mx V mx V. 7,3)] 


The standard nonreductionist reply to the reductionist’s “disjunctive 
strategy” 1s to "question the propriety of [(7;x V sx V. 7,x)] asa 
legitimate property ”°? But the considerations of the previous sec- 
tions undermine this standard nonreductionist reply, for they pro- 
vide the reductionist with a well-motvated and plausible account of 
“disyunctive properties,” and on this account the reductionist's ex- 
haustive disyunctive predicates would designate legitimate properties 
Thus, unless another reply to the disyunctive strategy can be formu- 
lated, rampant illegitimacy ıs avoided only at the cost of the argument 
from multiple realizability in support of nonreducibility 
Fortunately, another reply to the disyunctive strategy can be formu- 
lated, and the formulation of this reply serves to clarify NRP further 
Consider again the bridge principle schema (\/BP), and in particular 
the disjunctive predicate schema (arx V Tox WV Tx) What exactly 1s 
this disjunctive predicate schema schematic of? As 1s often pointed 
out, 1t 1s likely that mental property M, whatever it 1s, will have an 
infinite realization base This will mean that to predicate reduce u 
the reductionist must invoke an exhaustive disjunctive predicate that 
has infinitely many disuncts But nobody ever has, or ever will, 
formulate an infinite predicate Moreover, nobody has ever even 
formulated a single predicate 7x such that for some mental property 
M, the property designated by 7x realizes M (One reason for this 1s 
that the physical realizors of many mental properties are wide—they 
mclude complex environmental features external to the creature 
instantiating the mental state ) In short, the reductionist’s appeal to 
exhaustive disjunctive predicates 1s an idealization of a rather ex- 
treme sort We are familiar with many mentalistic predicates that, it 
1s assumed here, designate mental properties But we have no idea 


?9 Kim, The Philosophy of Mind, (New York Westview, 1996), p 219 
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what the relevant disjunctive predicates might be, and we have good 
reason to believe that we cannot really formulate such disjunctive 
predicates The reductionist’s appeal to disyunctive predicates must 
then be understood as a claim concerning what 1s en principle possible, 
it 1s a claim to the effect that some sort of epistemologically ideal 
being could reduce mentalistic predicates by exhaustive infinite dis- 
junctions of physicalistic predicates The reductionist 1s asserting the 
counterfactual claim that of we could formulate infinite physicalistic 
predicates of the form (7x V Tox V T,X), then we could reduce 
mentalistic predicates Moreover, the results of the previous sections 
suggest that the reductionist’s counterfactual claim 1s, despite the 
arguments against disjunctive properties, true But is the truth of the 
reductionist’s counterfactual claim incompatible with the thesis of 
nonreducibility? 

It all depends on how the thesis of nonreducibility 1s interpreted 
If the ‘cannot’ in this thesis 1s interpreted to be very strong, so that 
the thesis states that it 1s not even in principle possible to reduce 
mentalistic predicates, then the reductionist carries the day For his 
counterfactual claim 1s true :f'we could formulate exhaustive infinite 
physicalistic predicates of the form (ax V Tox V Tx), then we 
could reduce mentalistic predicates But if the ‘cannot’ 15 interpreted 
to be less strong, so that the reductionist's counterfactual claim 1s 
compatible with nonreducibility, then the defender of NRP carries 
the day And this is the appropriate response for the defender of NRP 
to make against the disjunctive strategy the defender of NRP should 
grant that 1t 1s n principle possible for mentalistic predicates to be 
reduced by exhaustive disyunctive predicates, but he should deny that 
this is compatible with nonreducibility This weaker version of NRP 
does not deny that ıt ıs in principle possible for “ideal” scientists to 
formulate physicalistic predicates that would reduce our mentalisuc 
predicates (It 1s difficult to see how such a denial could be compat- 
ible with physicalism ) It rather claims that we really shall not and 
cannot reduce our mentalistic predicates to physicalistic predicates 
One might object that by weakening the thesis of nonreducibility in 
this way, NRP 1s rendered a purely epistemological and thus uninter- 
esting doctrine For on this weaker view, the nonreducibility of 
mentalistic predicates 1s not explained by mental properties and 
physical properties being in different ontological categories, there 1s 
no reason to assume that mental properties are in any ontologically 
significant sense on a “higher level,” nor that they in any ontologically 
significant sense “emerge” from physical properties Rather, the 
weaker view suggests that the nonreducibility of mentalistic predi- 
cates ıs purely due to our own epistemological mutations But, first, 
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I do not think this 1s an objection so much as it 1s a clarification of 
what a plausible version of NRP must claim, NRP, properly under- 
stood, 1s primarily an epistemological doctrine It alleges that the 
theories and predicates of the special sciences never will in fact be 
reduced to physical theory And, second, even this weaker version of 
NRP ought to satisfy psychologists, economists, and nonreduction- 
minded philosophers on this weaker conception of NRP, generali- 
zations expressed in terms of mentalistic predicates can be supported 
by evidence, and can support counterfactuals Thus there 1s no 
special reason to doubt the existence of psychological, economic, and 
such laws Consequently, even on this weaker conception, psychology 
and the other special sciences are guaranteed autonomy from phys- 
ics, though, like physics, they are by no means guaranteed to be 
successful 

LENNY CLAPP 
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THE END OF MILLIANISM MULTIPLE BEARERS, 
IMPROPER NAMES, AND COMPOSITIONAL MEANING* 


uhanism, ın J S Mill’s! words, claims that 


Proper names are not connotative They denote the individuals who are 
called by them, but they do not indicate or imply any attributes as 
belonging to those individuals (2d , I, 2, 5, 33) 


Mill argued that the proper name ‘Dartmouth’ cannot be taken to 
have the connotation (sense) city at the mouth of the wer Dart because 


If sand should choke up the mouth of the river [Dart], or an earthquake 
change its course, and remove it to a distance from the town, the name 
of the town would not necessarily be changed That fact, therefore, can 
form no part of the signification of the word, for otherwise, when the 
fact confessedly ceased to be true, no one would any longer think of 
applying the name (ibid , I, 2, 5, 34) 


Even though this argument ıs clear, cogent, and appeared in a widely 
read book by a highly influential philosopher, 1t took over a century 
for the argument to have a significant impact on the philosophy of 
language 

The reason for the delay in such impact is that the philosophers 
whose work shaped the discussion of names simply took no notice of 
Mill’s argument Gottlob Frege,? who paid close attention to every- 
thing Mill said about number ın Books II and III of A System of Logic, 
ignored everything he said about names in Book I. Bertrand Russell? 
developed his theory that proper nouns like ‘Socrates’ are descrip- 
tions without ever mentioning Mill's argument against such theories 
Ludwig Wittgenstein,* who might well have adopted Mill's view, 
adopts an apparently incompatible one, roughly, that a name ıs 


*I wish to thank Bradley Armour-Garb, Kent Bach, Edward Becker, Charles 
Chihara, Arthur Collins, Ned Block, David Pitt, and Virginia Valian Thanks also to 
my audiences at Centre de Recherche en Epistémologie Appliquée, the December 
1999 New York Conference on Methods, the Simon Fraser University Philosophy 
Department, and the University of California/Berkeley Philosophy Department 

1 A System of Logic, Volume I (London Parker, Son, and Bourn, 1862) 

2 The Foundations of Arthmetic, JL Austin, trans (Oxford Blackwell, 1953), pp 
29-32 

3 "The Philosophy of Logical Atomism,” The Monist, xxvi (1918) 523-25 

* Philosophical Investigations (Oxford. Blackwell, 1953), 8 79 
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associated with a family of descriptions from which a speaker draws as 
the occasion demands Furthermore, we look in vain in Meaning and 
Necessity A Study in Semantics and Modal Logic, Rudolf Carnap's? cele- . 
brated reconstruction of Fregean semantics, for what he has to say 
about Mill's argument Finally, taking his cue from Wittgenstein's 
discussion of names, John Searle proposed his "sufficient but so far 
unspecified number of [descriptions]” view, but, again, there 1s no 
mention of Mill's argument It is not until Saul Kripke's? rmaginative 
elaboration of Millan modal reasoning in Nammg and Necessity that 
the argument comes into its own Over the next decades, Kripke’s 
barrage of Mill-style counterexamples succeeded in persuading most 
philosophers of language of the truth of Millianism 


I THE PURE METALINGUISTIC THEORY 

Over the years, I have been arguing that Millan modal reasoning 
does not establish the truth of Milhanism 8 Such reasoning shows that 
classical, Fregean, descriptivism ıs false, but it does not show that 
Milhanism ıs true because ıt does not show that descriptivism 1s false 
The reasoning shows that proper names do not have a Fregean sense, 
but not that they do not have a sense The hitch is that the Fregean 
notion of sense 1s not the only notion of sense, and, hence, classical 
descriptivism ıs not the only form of descripttvism In addition to 
classical descripuvism, there is the option of a revisionist, non- 
Fregean, form of descriptvism 

This option has been neglected every bit as much as Mill's argu- 
ment, though, to be sure, for a totally different reason Non-Fregean 
descriptivism was neglected not because philosophers chose to ignore 
it, but because, equating sense with F regean sense, and, hence, 
mtensionalism with Fregean intensionalism, they simply had no ın- 
kling of its existence Thus, the first step in correcting the situation 1s 
to provide a non-Fregean definition of sense This 1s (D) 


5 Chicago University Press, 1956, Enlarged Edition, Section 79 

$ "Proper Names," Mind, xvi (1958) 166-78 

? "Naming and Necessity,” in Donald Davidson and Gilbert Harman, eds , Seman- 
tics of Natural Language (Boston Reidel, 1972), pp 253-855, Naming and Necessity 
(Cambridge Harvard, 1980) References in this paper are to the latter pubhcation 

8“A Proper Theory of Names,” Philosophical Studis, xxx1, 1 (1977) 1-80, "The 
Neoclassical Theory of Reference," in Peter A French, Thomas E Uehling, Jr , and 
Howard K. Wettstem, eds, Contemporary Perspectives in the Philosophy of Language 
(Minneapolis Minnesota UP, 1979), pp 103-24, “Has the Description Theory of 
Names Been Refuted?" in George Boolos, ed, Meaning and Method (New York 
Cambridge, 1990), pp 31-61, “Names without Bearers,” The Philosophical Review, c1 
(1994) 1-39, Sense, Reference, and Philosophy (forthcoming) 
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(D) The sense of an expression 1s that aspect of its structure which 1s 
responsible for its sense properties and relations, that 1s, having a 
sense (meaningfulness), sameness of sense (synonymy), multiplicity 
of sense (ambiguity), repetition of sense (redundancy), opposition 
of sense (antonymy), and so on 


What makes (D) a non-Fregean definition of sense 1s that (D) makes 
sense the determiner of sense properties and relations rather than the 
determiner of referential properties and relations 

The pay-off for switching to (D) 1s that senses no longer have to 
contain the substantive properties that trigger Mullan counterex- 
amples On the classical description theory, senses, being reference 
determiners, must contain a cluster of substantive properties which 
suffices to determine reference Otherwise, they do not qualify as 
bona fide senses With (D), 1t becomes possible to construct a version 
of the description theory on which the sense of a name contains no 
substantive properties None is needed because we no longer have to 
ensure that the sense of a name determines its referential properties 
and relations Thus, (D) enables us to formulate the one description 
theory which avoids Millian counterexamples 

This theory, the pure metalngusstic theory (PMT for short), says that 
the sense of a name contains only the nominal property of being the 
name of 1ts bearer(s) On PMT, the sense of a proper name ‘N’ is an 
instance of (1) 


(1) the thing which ıs a bearer of N’ 


where “x 1s a bearer of ‘N’” ıs understood as expressing the relation 
between an individual and a name it bears PMT can take this view 
because (D) requires that decisions about whether a term has a sense, 
and about what sense it has, are to be based on an inference to the 
best explanation of the term's sense properties and relations For 
example, we take the sense the thing which 1s a bearer of John Stuart Mull’ 
as the sense of the name ‘John Stuart Mill’ not because doing so 
enables us to determine its referent but because doing so enables us 
to explain inter alia why (2) analyucally entails (3) 


(2) John Stuart Mill wrote A System of Logic 
(3) A bearer of ‘John Stuart Mill’ wrote A System of Logic 


Similarly, taking ‘Socrates’ to mean the thing which ıs a bearer of 
‘Socrates’ enables us to explain why (4) 1s analytic, or, as John Locke 
put it, trifling 


(4) Socrates is a bearer of ‘Socrates’ 
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There 1s, of course, the question of whether purity—the absence of 
substantive properties 1n the senses of names—is enough to guaran- 
tee the absence of Milhan counterexamples It is enough if the 
nominal properties appearing ın instances of (1) do not themselves 
generate such counterexamples Shortly, I shall explain why they do 
not, and, subsequently, describe the bearing of these considerations 
on a challenging thesis of Kripke’s about epistemology and modality 

Even the most sophisticated Fregean theories of names (for exam- 
ple, Searle’s) face Millian modal problems, and, by the same token, 
even the most sophisticated Millan theories of names (for example, 
Kripke’s) face Fregean semantic problems PMT faces none of those 
problems because 1t has none of the features responsible for them 

Millan modal problems arise for Fregean theories because Fre- 
gean senses of names contain substantive properties The need for 
Fregean theories to say that names have a sense, which arises from the 
Fregean puzzles that motivate the introduction of senses, translates 
automatically, on such theories, into the need to say that names have 
a Fregean sense So, the sense of a name is taken to have the 
substantive properties required for reference determination, and, as 
a consequence, there are modal counterexamples to the description 
alleged to express the sense of the name Given (D), we can have 
senses that are instances of (1), and, hence, we can have Fregean 
solutions to Fregean puzzles without Fregean senses 

To be sure, Kripke thinks modal counterexamples arise for meta- 
linguistic description theories, too, but, unlike PMT, the metalinguis- 
tic description theories for which they arise are impure in that they 
supplement the metalinguistic relation ın the senses of names with 
substantive properties William Kneale? claimed that ‘Socrates’ 
means the individual called ‘Socrates’, since “it 1s obviously trifling to tell 
[someone] that Socrates was called Socrates” (sbid, pp 629-30) 
Kripke first objects to Kneale’s theory because “it ıs dubious that the 
Greeks did call him ‘Socrates’ ”!° This criticism does not apply to our 
explanation of the analyticity of (4) because “x1s a bearer of ‘N’” and 
“xis called 'N"" are different relations The former but not the latter 
1s the converse of “‘N’ is a name of x” The latter means something 
like ‘is addressed as’, and, accordingly, whereas the yplace of the 
bearer relation takes only proper nouns, the place of the relation “x 
1s called y” takes common nouns as well—for example, “Smith was 


° "Modahty, De Dicto and De Re,” ın Ernest Nagel, Patrick Suppes, and Alfred 
Tarski, eds , Logw, Methodology, and Philosophy of Science (Stanford Unversity Press, 
1962), pp 622-33 

10 Naming and Necessity, p 69 


THE END OF MILLIANISM 141 


called a thief and a dastardly murderer " Thus, while Kripke 1s right 
to say that sentences like ‘Socrates was called ‘Socrates” are not 
trifling or analytic, and might be false, (4) 1s analytic and hence 
cannot be false (;5::d) Kripke’s counterexamples to Kneale’s impure 
metalinguistic description theory do not carry over to PMT !! 

Kripke’s “main” objection to Kneale’s metalinguistic description 
theories 1s that “[such a theory] would give a clear violation of the 
noncircularity condition ” As Kripke puts it 


According to Kneale, the description [for the name ‘Socrates’] 1s ‘the 
man called ‘Socrates’’ And here, (presumably, since this is supposed to 
so trifling!) ıt tells us nothing at all Taking it in this way ıt seems to be 
no theory of reference at all We ask, “To whom does he refer by 
‘Socrates’>” And then the answer is given, "Well, he refers to the man to 
whom he refers "!? 


Kripke ıs right that a theory on which the senses of names are 
genuinely trifling 1s “no theory of reference at all” But what of 1t? 
Not all description theories are intended to be theories of reference 
Since PMT ıs intended to be a theory solely of the senses of names, 
the sense that PMT provides for a name like ‘Socrates’ 1s required to 
determine only the sense properties and relations of the name 
Fregean semantic problems arise for Millian theories because those 
theories deny that proper names have senses at all The need to avoid 
Mill-style counterexamples, which arises when the senses of names 
contain substantive properties, translates, on the Millian’s equation 
of the senses with Fregean senses, into the need to avoid senses 
altogether But, given (D), the intensionalist claim that names have a 
sense and the Fregean claim that names have a reference-determin- 
ing sense come apart Hence, the Millan can be nght to deny that 
names have a reference-determining sense, but wrong to deny that 
they have asense Thus, (D) provides us with the option of saying that 


11 A further point in connection with Knpke’s parody 1s that we can also show, on 
the basis of (D), that ‘horse’ does not mean thing named ‘horse’ or thing bearing the 
name ‘horse’ Aside from the issue Kripke (Naming and Necessity, p 69) raises, the 
point ıs important for our classification of nouns at the beginning of section Irt 

If common nouns could be so represented, then, on the construal of ‘Boys are 
bearers of ‘Boy’’ where the subject is the (plural) common noun ‘boys’, the 
sentence would be analytic in the manner of (4) 

But, on this construal, the utterances of the sentence are false Manny, Moe, and 
Jack are named ‘Manny’, ‘Moe’, and ‘Jack’, not ‘Boy’ Actually, the sentence is 
ambiguous But it 1s, on the other construal—the one on which its subject 1s the 
proper noun ‘Boy’ and the sentence 1s thereby parallel to ‘Smiths are bearers of 
‘Smith’ ’—that ‘Boys are bearers of ‘Boy’ ' is analytic For example, Tarzan's son 1s 
named ‘Boy’, see the 1939 MGM film Tarzan Finds a Son 

1? Naming and Necessity, p 70 
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names have a sense but that their sense determines no more than 
their sense properties and relations 

Such thin senses are all that 1s necessary to solve the puzzles that 
Milhan theories fail to solve Once Millianism 1s adopted, Frege's!? 
puzzle about identity sentences with names resurfaces because all we 
are left with to distinguish sentences like (5a) and (5b) are referential 
and orthographic differences between the proper names 1n them 


(5a) Hesperus 1s Hesperus 
(5b) Hesperus 1s Phosphorus 


Neither reference nor orthography, as Frege himself pointed out, can 
account for the fact that sentences like (5a) "hold a pron and, 
according to Kant, are analytic," while sentences like (5b) "contain 
very valuable extension[s] of our knowledge" (1d, p 56) But if we 
have senses for names, even ones as thin as on PMT, we can account 
for the difference in meaning between sentences like (5a) and (5b) 
Taking ‘Hesperus’ and ‘Phosphorus’ to have senses that are instances 
of (1) 1s taking these names to have different senses, which is all that 
1s required to explain why (5a) 1s analytic and (5b) 1s not 

Similarly, taking ‘Londres’ and ‘London’ to have senses that are 
instances of (1) ıs taking them to have different senses, and this ıs all that 
1s required to solve Kripke's Pierre puzzle 1* Since Pierre’s beliefs are 
now the thing which 1s a bearer of ‘Londres’ 1s pretty and the thing which 1s a 
bearer of ‘London’ ıs not pretty, Pierre’s beliefs and statements concerning 
the city he read about and the city m which he 1s currently living are 
different, and, hence, not consistent, and there 1s no puzzle 

Scott Soames?® says “Clearly, [Pierre] has said both that London 1s 
pretty and that London ıs not pretty ” (:bid, p 407) But Pierre 
never believed or said that London 1s pretty One can, of course, say 
that Pierre believes or said of London that it ıs pretty, but this ıs not 
saying that he believes or said that London 1s pretty If Oedipus 
believed and said that Jocasta 1s his wife, then Oedipus believed and 
said of his mother that she 1s his wife, but this 1s not to say that he 
believed and said that his mother 15 his wife What Pierre believed 
and said 1s that the thing which 1s a bearer of ‘Londres’ ıs pretty, something 
quite consistent with believing and saying that the thing which 1s a bearer 


13 “On Sense and Reference" in Translations from the Philosophical Writings of Gottlob 
Frege, Max Black and Peter Geach, eds (Oxford Blackwell, 1952), pp 56-57 

14 *A Puzzle about Behef," 1n Nathan Salmon and Scott Soames, eds , Propositions 
and Attitudes (New York Oxford, 1988), pp 102-48 

15 “Direct Reference and Propositional Attitudes," ın Themes from Kaplan (New 
York Oxford, 1989) 
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of ‘London’ ıs not pretty PMT thus explains why faultless reasoning 
from acceptable empirical evidence does not land Pierre in contra- 
diction Moreover, that they should not Jand him in contradiction 1s 
surely everyone’s intuition 

Frege’s problem about the meaningfulness of expressions and 
sentences containing bearerless names also resurfaces when Millian- 
ism ıs adopted 1® The most acute form of this problem, as I ar- 
gued elsewhere,! ıs the case of true, hence meaningful, sentences 
hike (5c) 


(5c) Santa Claus does not exist 


Since Milhanism claims that bearerless names like ‘Santa Claus’ are 
also senseless, such names have nothing to contribute to the compo- 
sitional meanings of sentences Hence, Millians must deny the mean- 
ingfulness of perfectly meaningful existence sentences like (5c) 
These sentences are counterexamples to Millianism In contrast, on 
PMT, bearerless names like ‘Santa Claus’ have a sense, and, hence, 
have something to contribute to the compositional meanings of 
sentences Accordingly, existence sentences like (5c) have a meaning 

Further, we should indicate briefly how PMT handles Knpke’s 
Paderewsk case 18 Someone who realizes that the politician and the 
musician are one and the same person might express the realization 
by saying “Paderewski the politician 1s Paderewski the musician " In 
this case, the dummy term ’thing’ ın the sense of the first occurrence 
of ‘Paderewski’ would get filled with the sense of the apposition ‘the 
poliücian', and the dummy term ‘thing’ in the sense of the second 
occurrence of ‘Paderewski’ would get filled with the sense of the 
apposition ‘the musician’ Thus, the sentence would have the sense 
the politician who ıs a bearer of Paderewski’ 1s the musician who 1s a bearer 
of ‘Paderewskt’ Since filling of the dummy terms takes place in the 
process of forming compositional senses for ‘Paderewski the politi- 
cian’ and ‘Paderewski the musician’, the fact that the senses of these 
noun phrases contain substantive properties has no bearing on the 
status of PMT's claim that a proper noun like ‘Paderewski’ contains 
no substantive properties (1) asserts nothing about the senses of 
constituents 1n which proper names occur as proper parts 

In a case where the speaker says simply, "Paderewski 1s Paderewski,” it 
must be clear in the context that the statement 1s the denial of a prior 
presumption that the Paderewskis are different people PMT provides 


16 ^On Sense and Reference," p 58 
17 "Names without Bearers,” pp 1-4, and p 34, footnote 45 
18 “A Puzzle about Belief,” pp 102-48 
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the sentence meaning required for such a prior presumption The 
indefinite quantifier m the sense of ‘Paderewski’ provides the possibility 
of disjoint reference, the definite quantifier and the term provide the 
possibility of hnking the occurrences of 'Paderewski! with background 
information (for example, about occupation) that makes them contex- 
tually definite ın ways suggesting disjoint reference to someone with 
certain beliefs (for example, about occupation) 

Elimmating substantive properties from the senses of proper 
names eliminates modal counterexamples just in case the nominal 
properties in instances of (1) do not give rise to such counterex- 
amples David Braun’? contends that they do, he says that PMT gets 
“the wrong modal result” ın the case of an analytic sentence like (4) 
because its analyticity commits PMT to saying that Socrates bears 
‘Socrates’ 1n all worlds where he exists, but, in fact, the name some- 
thing bears is only one of its contingent properties (1d , pp 559-60) 
Braun's criticism rests on a mistake about PMT, but the mistake 
might not have been made had my earlier discussions of names been 
clearer about why ıt ıs that taking sentences like (4) to be analytic 
does not commit PMT to claiming that something’s bearing the name 
1t bears ıs one of its necessary properties Those discussions contain 
the principle which explains why there 1s no such commitment, but 
they do not use it to provide the explanation 7° 

Braun's mistake ıs to equate the intensionalism within which PMT 
1s formulated with Frege/Carnap mtensionalism To be sure, within 
Frege/Carnap intensionalism, analytic sentences express necessary 
(or weakly necessary) truths, and, hence, a metalinguistic description 
theory formulated within ıt would get “the wrong modal result "?! But 
the mtensionalism ın which PMT 1s formulated differs from Frege/ 
Carnap intensionalism 1n how it understands analyticity as well as in 
how it understands sense Not only are senses not reference deter- 
miners, analytic sentences are not necessary (or weakly necessary) 
truths In Semantic Theory? I defined a sense of a simple sentence— 
one having the form of a predicate P(x,, ,x,) with terms &, ,t, 
occupying its argument places—as being analytic if there 15 a term t, 
h S t S t, that contains P(t, ,t,4, 444, ,£) Tbe difference be- 
tween containment here and containment in Frege's definition 1s 


19 “Katz on Names without Bearers,” The Philosophical Review, civ, 4 (1995) 
559-60 

20 Cf my “Has the Description Theory of Names Been Refuted?" pp 37-46 

21 Kripke, “Identity and Necessity,” ın Milton K. Munitz, ed , Identity and Indwid- 
uation (New York University Press, 1971), pp 135-64, here p 137 

22 New York Harper and Row, 1972, pp 171-97 
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that the former is literal “beams ın a house” containment while the 
latter 1s figurative “plants in their seeds" contamment ?? Our analy- 
ticity is mereological structure, Frege's 1s logical structure 

Frege's analyticity 1s provability from laws of logic plus definitions 
(bid , pp 3-4) An analytic statement, to use W V Quine's?* formu- 
lation of Fregean analyticity, ıs a logical truth ın which we have put 
“synonyms for synonyms” (zbid, pp 22-23) Since Fregean analyticity 
is thus a species of logical truth, the referential interpretation of 
Fregean analyticity 1s fixed by the referential interpretation of logical 
truth Not so with our analyucity Since the mereological structure of 
analytic sentences does not make them instances of logical truths, it 
1s open to us to provide a different principle of referential interpre- 
tation for analytic sentences In earlier publications,” I proposed the 
principle that analytic sentences are “secured against falsehood ” 
That ıs to say, if a literal use of an analytic sentence makes a state- 
ment, then it 1s a true one For, if the containing term refers, then, 
1n virtue of the containment, it refers to a state of affairs in which the 
statement ıs true, and 1f the term does not refer, the analytic sentence 
makes no statement 

Security against falsehood does not imply necessity or weak neces- 
sity If I use a token of (4), the token must, the world being as ıt 1s, 
make a true statement, but the statement that 1t makes 1s contingent 
The statement ıs about someone who might not have been a bearer 
of ‘Socrates’ Since PMT is not committed to claiming that the 
statements we make with (4) are true ın all worlds where Socrates 
exists, the existence of Socrates in a world where he does not bear the 
name ‘Socrates’ is no counterexample to PMT PMT does not get 
“the wrong modal result ”26 


23 Foundations of Arithmetic, p 101 

24 “Two Dogmas of Empincism,” ın From a Logical Poit of View (Cambridge 
Harvard, 1953), pp 20-46 

?5 Semantic Theory, p 179, “A Proper Theory of Names,” pp 37-38, “The Neo- 
Classical Theory of Reference,” p 115 

36 I want to thank David Pitt for discussions which helped clarify my thinking on 
some of these issues For further discussion of Braun's criticisms, see Bradley 
Armour-Garb and Katz, "What's in a Name" (forthcoming) 

Note that we can make the referential interpretation of analytic entailments like 
that of (3) by (2) correspond to the referential interpretation of analyticity if we 
take the former to be preservation of secunty against falsehood Note also that there 
1s a class of analytic sentences, including sentences like ‘Bachelors are unmarried' 
or ‘Cats are animals’, which, 1n addition to being secured against falsehood, are 
weakly necessary truths—see my “Analyticity, Necessity, and the Epistemology of 
Semantcs,” Philosophy and Phenomenological Research, vv, 1 (1997) 1-28, and Sense, 
Reference, and Philosophy 
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The conclusion 1s that PMT avoids the modal counterexamples 
that Fregean descriptivism does not avoid and solves Frege’s puzzles 
about sense that neo-Milhanism does not solve Note that this con- 
clusion makes no claim that PMT solves all the puzzles that come 
down to us from Frege’s “On Sense and Reference ” Not all of them 
are puzzles about sense, and PMT 1s not obligated to solve those 
which are not about sense In particular, Frege’s puzzle about the 
substitution of coreferential names in opaque contexts, though com- 
monly lumped together with the puzzles about sense, 1s not such a 
puzzle 2” To be sure, it seems to go with Frege's puzzles about identity 
sentences with names and bearerless names, particularly as he pre- 
sents them all as problems solvable on the basis of his notion of sense 
But, seen from the perspective of (D), the puzzle about substitution 
does not belong with the puzzles about identity sentences with names 
and sentences with bearerless names The latter are problems about 
how to explain sense properties and relations of sentences, respec- 
tively, analyticity and meaningfulness, but the former 1s not The 
puzzle about substitution 1s a problem about the conditions for valid 
substitutional inference—that ıs, ıt ıs about the truth of statements 
Hence, ıt belongs to the theory of reference It ıs thus outside a 
theory of sense based on (D) *8 

We come now to the issue about which, as I shall argue below, PMT 
and neo-Millianism are both wrong PMT and neo-Millianism say that 
(T) 1s false but 1t turns out to be true 


(T) Names can have a sense consisting of substantive properties (as the 
classical description theory claims) 


The truth of (T) 1s bound at this point to appear surprising because 
of the strength of Milhan reasoning when deployed with the full 
range of Kripke's counterexamples But the truth of (T) 1n fact has 
nothing to do with the logical status of Millan counterfactual argu- 
ments, which do refute classical descriptivism Rather, it has to do 
with the semantics of a special class of poorly understood expressions, 
ones which I shall refer to as zmproper names 


27 “On Sense and Reference,” pp 6467 

28 The point here ıs of more than historical interest It undercuts the assumption 
of neo-Millians like Soames (op cit) that the issue about the meaning of proper 
names to which they believe direct reference 1s the definitive answer must be dealt 
with ın the same theory that treats the statements that uses of propositional attitude 
sentences make in context The point shows that considerations from this theory 
which have been used to motvate the direct reference story about proper names 
does not ın fact do so 
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So, here are the theses of this paper (T) 1s true, and, accordingly, 
neo-Millianism and PMT require revision The revision in the case of 
PMT is minor, and the minor revision 1s the end of it. Essentially, the 
same minor revision 1n the case of neo-Millianism seems at first also 
to be the end of it, but things turn out otherwse Furthermore, 
examination of the largely neglected question of how names obtain 
their unique reference shows that such reference, even on Kripke’s 
theory, requires descriptions *° This brings the issue between neo- 
Millianism and PMT down to Millianism Then I present an argu- 
ment to refute Millianism The bottom line 1s that the revised PMT 1s 
our only tenable theory of names 


II MULTIPLE BEARERHOOD 

In this section, I want to show that the meaningfulness of proper names 
1s the only issue between neo-Millianism and PMT It looks at first as if 
the issue of whether descriptions mediate the reference of names also 
divides the two theories, especially insofar as neo-Milhans advertise their 
theory as description free This, however, is an appearance created, on 
the one hand, by a proposal of Kripke's that neo-Milhans must reject, 
and, on the other, by not recognizing how much of his causal picture of 
naming non-Fregean mtensionalists can accept. 

IT 1 Linguistic evidence The vast majority of proper names in nat- 
ural language have multiple bearers, yet the vast majority of their 
tokens have unique reference Since classical descriptivists treat a 
proper noun as if it were semantically a common noun, they try to 
reconcile multiple bearerhood with unique reference by saying that 
proper names are ambiguous, explaining the unique reference of the 
tokens of “ambiguous” proper names in terms of disambiguation °° 
But proper names are not ambiguous They are only referentially 
equivocal They have multiple bearers, not multiple senses If I use a 
token of (6) 1n speaking to philosophers 


(6) Nagel has taught in New York City 


their problem 1s not how to choose between two senses of a word (the 
name ‘Nagel’), as, for example, ıt would be if I had used a token 
of (7) 


(7) Let’s meet at the bank 


but how to choose between the two philosophers Ernest and Thomas 
(the bearers of ‘Nagel’) We disambiguate a common noun by citing 


29 Naming and Necessity, p 96 
30 For example, Russell, p 525 
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a difference between its senses, but we cannot do this with proper 
nouns What sense difference disambiguates the proper noun ‘Nagel’ 
in the way that the sense difference slope and financial establishment 
disambiguates the common noun ‘bank’? 

Millians, too, have a problem with multiple bearerhood According 
to Mill, a name ıs a meaningless mark that distinguishes the object(s) 
to which it refers on the basis of its sound or orthography Recall his 
example of the robber in the Arabian Nights who put a chalk mark on 
a house for purposes of identification, but 


Morgana chalked all the other houses ın a similar manner, and defeated 
the scheme how? simply by obliterating the difference of appearance 
between that house and the others The chalk was there, but ıt no longer 
served the purpose of a distinctive mark ?! 


Mill's account fails to explain how speakers of a natural language 
perform the identifying function in their use of names Since all the 
John Smiths are bearers of the name ‘John Smith’, “the difference of 
appearance" on which Mill's account of the identifying function of 
proper names depends 1s not there For Mill, speakers are hke the 
robbers that Morgiana foiled 

Neo-Millhans who follow Mill in taking the distinguishing function of 
a proper name to be accomplished on the basis of sound or orthography 
also have a problem with muluple bearerhood Since the sound and 
orthography of the name ‘John Smith' is the same from one John Smith 
to another, ‘John Smith’ does not serve as a distinguishing mark among 
the many John Smiths In the best of all possible worlds for the neo- 
Milhan, names would mark their bearers m the same precise way as 
numbers assigned to prisoners, but, ın the actual world, the sound and 
orthography of names in natural languages 1s not a notation that keeps 
track of their bearers like prison numbers 

Hence, there 1s considerable pressure on neo-Millians to go beyond 
Mill and concoct a notation on which the name of one John Smith 1s 
somehow a different mark from that of another Kripke succumbs to 
this pressure, proposing that we 


adopta terminology according to which uses of phonetically the same 
sounds to name distinct objects count as distinct names 22 


Kripke says this terminology ıs “analogous to the practice of calling 
homonyms distinct ‘words’ (:bıd.) This, however, is patently false 
Homonyms, such as ‘bear’ and ‘bare’, differ in etymology, (typically) 


31 A System of Logic, I, 2, 5, 36 
3? Naming and Necessity, p 8 
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orthography, syntax, and meaning Knpke's distinct names, such as 
‘Smith’ used to name one person and ‘Smith’ used to name another, do 
not differ ın etymology, orthography, syntax, or meaning (since both 
names ‘Smith’, bemg names, are supposed to have zero meaning) 

Kripke’s proposal might be, and sometimes has been, construed as 
claiming that occurrences of a name like ‘Smith’ have unpronounced 
subscripts that distinguish occurrences of ‘Smith’ in terms of the 
different bearers of the name Since this claim constitutes a substan- 
tive hypothesis about the language, it has to be backed up with 
substantive linguistic evidence, particularly, as accepting the claim 
would sacrifice a long-standing linguistic principle, namely, Noam 
Chomsky’s autonomy of syntax °° Linguistic methodology requires 
grammatical evidence for positing underlying grammatical forms, but 
there 1s no such evidence for claiming that unpronounced subscripts 
are part of the grammatical form of proper names ?* 

Further, as Knpke himself concedes, his terminology “does not agree 
with the most common usage 35 Shortly, we shall examine the conflict 
with the linguistic fact, but, before we do, we should look at the two 
considerations that Kripke offers to mitigate the concession First, he 
suggests that his terminology ıs appropriate for the question ‘How many 
names are there in this telephone book?” But to suppose that the 
answer to this question 1s got by counting each occurrence of a name as 
a distinct name, rather than by counting each simply as a distinct 
occurrence of the same name, begs the question Second, Kripke says 
that his proposal has "a great deal to recommend it for theoretical 
purposes” (4d) He does not elaborate, but, presumably, the purposes 
In question are not providing neo-Milhanism with different names for 
different bearers, since, then, Kripke would be arguing from his own 
theory The only other thing that suggests itself 1s the purposes the 
terminology might serve ın logic—for example, the purposes it serves in 
Enpke's quantified modal logic 38 But the usefulness of a formalism for 
an artificial calculus implies nothing, one way or the other, about its 
usefulness for a grammar of a natural language 


33 That is, the principle that syntactic rules cannot refer to pragmatic, phonolog- 
ical, or semantic information—Essays on Form and Interpretation (Amsterdam North- 
Holland, 1977), pp 36-59 

54 For a discussion of the kind of evidence required for positing underlying 
phonological structure, see MJ Kenstowicz, Phonology in Generative Grammar (Cam- 
bridge Blackwell, 1994), pp 107-14 

35 Naming and Necessity, p 8 

36 Kripke, "Semantical Considerations in Modal Logic,” Acta Philosophica Fennaca, 
xvi (1963) 88-94 
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Whereas classical descriptivists proliferate senses for referentually 
equivocal names, Krıpke proliferates names themselves 7 Kripke's 
proliferation, however, 1s not the ordinary sin against parsimony in 
which the offending proposal accounts for the same evidence less 
economically Here the proposal which offends against simplicity 1s 
also at variance with the evidence over a wide range of linguistic 
structures 

The evidence clearly shows that the proper noun ‘John Smith’ 1s a 
single name with many bearers We quite naturally say such things as 
(8) and (9) 


(8) The fifty John Smiths hving in New York are Yankee fans 
(9) I know one John Smith, but there must be more in America 


But, on Kripke’s proposal, this is deviant Most strikingly, his proposal 
denies that two John Smiths are speaking the literal truth when they 
make a claim like (10) or (11) 


(10) We have the same name 
(11) ‘John Smith’ 1s my name, too 


It also affirms that two John Smiths are speaking the literal truth 
when they say (12) 


(12) We have different names 


37 Underlying both proliferations 1s a conflation of a feature of the structure of 
language—the possibility of proper names having multiple bearers—with a feature 
of the use of language—the fact that referentially equivocal types have contextually 
definite tokens The conflation becomes evident ıf we ask how all the disunct 
Knpkean names ‘John Smith’ are to be individuated Homonyms are individuated 
1n terms of linguistic factors concerning etymology, orthography, and meaning, but 
such factors do not apply in the case of Knpkean names We have to appeal to 
extralinguistic considerations, in particular, the different bearers 1n each case This 
1s because specifying those names requires us to collect uses of the proper name, 
tokens of ‘John Smith’, into equivalence classes on the basis of something like the 
idea behind Russellian propositions Each equivalence class of utterances repre- 
senting a Smith "homonym" contains every token whose referent 1s a particular 
Smith plus the Smith who 1s their referent 

The referent must be included in the equivalence class, since otherwise there will 
be many cases where different names are associated with the same equivalence class 
Note cases like the drill sergeant who uses an utterance of ‘Smiths front and center! 
to refer to several Smiths Since the sergeant’s utterance will appear in more than 
one equivalence class, the equivalence class for each of these Smiths has to include 
the relevant Smith Since, on Knpke’s proposal, the many names ‘John Smith’ are 
each different words in spite of the fact that they are linguistically indistinguishable, 
having the same etymology, phonology, orthography, syntax, and (ex hypothesi 
null) sense, the names must include their referents as a linguistically distinguishing 
feature The inclusion of a ‘Smith’ referent as a part in the constitution of each 
distinct ‘Smith’ name makes the language/language-use conflation evident 
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Also, Kripke’s proposal says that two John Smiths are making the same 
claim about their name in saying (12) as the claim that a John Smith and 
a Mary Potter are making about therr names ın saying (12) Further, it 
hard to see how to make sense of the definite article 1n sentences like 
(13) without taking the name to have more than one bearer 


(13) The Albert Einstein lived in Princeton 


Sull further, one John Smith can “prove” to another John Smith that 
his name is not ‘John Smith’ Premise 1, “My name 1s ‘John Smith", 
premise 2, “Kripke’s proposal 1s true”, conclusion, “Your name is not 
‘John Smith’ ” 

Moreover, since Kripke’s proposal is a linguistic thesis, accepting it 
commits one to saying that the English word ‘namesake’ in a sentence 
like (14) 1s a necessarily vacuous term 


(14) My name 1s Mary Smith, too, so I’m her namesake 


It is no more possible for someone to have a namesake than to have 
a male sister 

Another aspect of the situation concerns name change If Monica 
Lewinsky married Kenneth Starr, she would have the choice of keep- 
ing her surname or changing it to her husband’s, but, on Kripke’s 
proposal that “distinctness of the referents will be a sufficient condi- 
tion for distinctness of names," the choice does not exist 38 On his 
proposal, even if Monica Lewinsky does take the name ‘Starr’, she has 
not changed her name ‘Lewinsky’ to her husband’s name 

This case ıs run together with the case ın which Brenda Starr 
marries Kenneth Starr Even though Kripke's proposal would give the 
right answer for the latter case, 1t gives ıt for the wrong reason The 
reason Brenda Starr has no choice about whether to change her 
name when she marries Kenneth Starr 1s not that distinctness of 
referent makes it logically impossible to have another’s name, but 
that, since her husband's names ‘Starr’, too, ıt makes no sense to talk 
about her changing her name There is no such thing as changing the 
name ‘Starr’ to the name ‘Starr’ 

Yet another aspect of the problem concerns family names On 
Kripke's proposal, there are no family names in the customary sense 
that all the members of the Smith family have the same name ‘Smith’ 
Even more counterintuitive, none of the members of the family can 
really have the family name ‘Smith’, since its bearer ought to be a 
family, hence a collection, not an individual How benighted of the 


38 Naming and Necessity, p 8, footnote 9 
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Smith family to be concerned about passing down the good name 
‘Smith’ or be worried about Raymond, the black sheep of the family, 
besmirching the family name! 

Furthermore, if the name of the son, George Smith, were, as it 1s on 
Knpke's proposal, a different name from that of the father after whom 
he was named, there would be no point to the ‘Junior’ associated with 
the son's name On Kripke’s proposal, the adjective ‘Junior’ in a sen- 
tence like (15) would be redundant or unnecessary, but it 1s not 


(15) I'd like you to meet George Smith, Junior 


Note, further, that even a neo-Milhan who rejects Kripke's proposal 
has no way to explain how the adjective ‘Junior’ enters composition- 
ally into the sense of (15) Using the quantifier structure represented 
in (1), PMT can say the adjective enters by way of attributing the 
property of being the son of another bearer of ‘N’ to the bearer of 
the ‘N’ functioning as the object of the verb PMT thus explains why 
(15) implies that there are or were at least two George Smiths 

Finally, lexicographers say such things as that the proper noun 
‘Smith’ ıs related to the Old Norse ‘Smitr’, the Danish and Swedish 
‘Smed’, and the Gothic ‘Smitha’ They take such statements to be 
about the etymology of the modern English word ‘Smith’ Surely, they 
are right that there ıs but one word ‘Smith’ ın English in just the 
(type) sense in which CS Peirce held that there is one word ‘word’ 
in English Now, each Smith can, with equal justice, proudly say, “It’s 
my name that the lexicographers find so interesting " Moreover, it 
makes no sense for two Smiths to argue about whose name 1s the one 
being talked about when lexicographers discuss the relation of 
‘Smith’ to the Old Norse 'Smitr! But, on Kripke's proposal, ıt does 
Further, insofar as on that proposal, there is no single name ‘Smith’ 
in the type sense, the proposal implies, absurdly, that the lexicogra- 
phers are talking about nothing 

Kripke’s way of putting his concession—that the terminology “does 
not agree with the most common usage"—makes ıt look as 1f nothing 
more 1s involved than deviation from certain superficial features of 
linguistic custom In fact, from the standpoint of the Imguistic evidence 
Just cited, Kripke's proposal about when to count phonological types 
and tokens as distinct proper names represents a throwback to the 
constructionalist programs of earlier philosophers like Frege, Carnap, 
and Gustav Bergmann to reform naturallanguage Kripke's proposal is 
a clear case of what Wittgenstein called “punfy[ing], subhm[mg] the 
signs themselves,” on all fours with Frege’s proposal to reform the 
language by assigning referentially vacuous expressions an arbitrary 


THE END OF MILLIANISM 153 


referent or eliminating them entirely ?? It 1s not easy to see why there 
should be so much more tolerance for Kripke's sublming the language 
m the case of names with two or more bearers than there 1s for Frege's 
subluming the language ın the case of names with zero bearers 

II 2 Philosophical consequences Names are not like the numbers 
assigned to prisoners. Neither for proper nouns nor for their tokens 
is there a phonologically or orthographically distinct name for each 
individual bearing the name Nor are names ambiguous like com- 
mon nouns Multiple bearerhood contradicts both Kripke’s proposal 
and the classical description theory PMT, ın contrast, 1s consistent 
with multiple bearerhood, but, more than this, 1t predicts the possi- 
bility that names have multiple bearers because (1) has an indefinite 
quantifier associated with the bearer place of the name relation 
Neo-Millians cannot, in principle, predict this possibility as a conse- 
quence of the grammar of proper nouns, since, in not allowing 
names to have senses, there 1s no place for an indefinite quantifier 
that introduces the possibility of more than one bearer 

To square their theory of names with the linguistic facts, neo-Millans 
must reject Kripke's proposal But rejecting the proposal opens a gap m 
the neo-Millan account of how a speaker achieves unique reference in 
the use of a name, and, to close 1t, neo-Millians have to invoke descrip- 
tions, thereby sacrificing their claim to have a descnption-free theory 

Consider Kripke’s theory of name transmission 


When the name 1s ‘passed from link to link’, the receiver of the name 
must, I think, intend when he learns it to use 1t with the same reference 
as the man from whom he heard it 4° 


In the case of names with multiple bearers, we receive the same name 
from different teachers who use 1t with different reference Each and 
every such transmission 1s the last link ın a distinct chain leading back 
to a different baptism For a popular name like 'Smith', we are awash 
in causal chains Each of them involves the same name, ‘Smith’, but 
a different Smith Hence, intentions of speakers to use a name with 
the same reference as the people from whom they acquired it do not 
suffice to achieve unique reference 

On the one hand, there 1s a communication problem Not being 
mind readers, the audience cannot know to which bearer of a refer- 
entially equivocal name the speaker 1s referring unless the speaker 
provides an individuating description On the other hand, there 1s a 


39 Wittgenstein, § 94, Frege, Translations from the Philosophical Writings of Gottlob 
Frege, p 70 
40 Naming and Necessity, p 96 
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reference problem Without appropriate substantive content as part 
of the speaker’s referential intention, he or she cannot use the 
referentially equivocal name to refer to the object named ın a par- 
ticular one of the baptisms to which the various chains involving the 
name lead back Big Louie has not referred to anyone if, believing 
that there are a number of Corleones in the Mob but having no 
identifying descriptions, he gives the police the name ‘Corleone’ 

Since neo-Millian theory does not rely on descriptions to identify 
the intended bearer, the theory 1s subject to essentially the same 
criticism Kripke makes of Kneale’s theory Paraphrasing we ask “To 
whom does he refer by 'Smith'?" And the answer 1s given “Well, he 
refers to the object named in the baptism initiating one of the chains 
for the name 'Smith' " Here Kripke’s objection to Kneale's theory 
comes home to roost 


If this were all there was to the meaning of a proper name, then no 
reference would get off the ground at all (:bıd., p 70) 


Since neo-Millian theory ıs a theory of reference, its claim that the 
application of names does not depend on descriptions is a casualty of 
multiple bearerhood Such theories can no longer be advertised as 
description free 

But, although the misleading advertising has to stop, the product 
does not have to come off the shelves A new and improved version 
of neo-Millianism can be sold as a purely referential account of names 
in which descriptions enter as part of the use of names, functioning 
to distinguish the intended bearer from other contextually salient 
bearers This 15 still a version of Millianism, since the neo-Millian 1s 
not forced to make the defeating concession that descriptions enter 
as senses of names 

But when a neo-Millan account of names ıs revised ın this way, 
descriptions play the same role on its account of naming that they 
play on PMT's account *! Hence, the alleged issue between neo- 


41 The role of descriptions in PMT is established by the definite quantifier in (1) 
associated with the head of the construction It introduces the linguistic require- 
ment of uniqueness for the reference of names, namely, that literal uses of (singu- 
lar) names are limited to one bearer per use Given the information from the 
indefinite quantifier, speakers know that their use of a name must distinguish their 
intended bearer from other bearers of the name—or at least those known in the 
context to be bearers When more than one bearer 1s known 1n the context and 
none of the bearers are salient, the speaker 1s required to supply descriptions which 
identify the bearer to which he or she intends to refer (Note the cuteness of 
someone using ‘Hey, Sam’ to get the attention of both Sams in the room ) 

Thus, on PMT, descriptions which ensure definite reference for proper names 
appear in the context rather than in the sense of the name This has two immediate 
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Milhanism and PMT about the role of descriptions disappears Fur- 
ther, since nothing prevents PMT from incorporating Kripke's 
conception of baptism and name transmission as an interpretation of 
the bearer relation in (1), there need be no issue between neo- 
Millianism and PMT with respect to the bestowing and transmitting 
of names ^? Hence, there ıs only one issue. Millianism 

Since criticism of classical descripavism has come from Millians, a 
fundamental objection to classical descnptivism has been missed, 
namely, that the descriptions classical descriptivists take to express the 
sense of a name are inadequate from a linguistic standpoint This 1s 
because descriptions like ‘a pupil of Plato and a teacher of Alexander’ or 
‘led the Israelites out of Egyptian captivity’ prevent us from accounting 
for sense properties and relations of sentences containing proper names 
like ‘Aristotle’ and ‘Moses’ For example, sentences like ‘Aristotle was a 
pupil of Plato and a teacher of Alexander’ and ‘Moses led the Israelites 
out of Egyptian captvity’ are not trifling as they should be if the descrip- 
tion their predicate expressed were actually the sense of their subject 
Frege took sense to be of no special concern so long as the description 
chosen for a name gets its reference nght, and classical descriptivists 
have embraced Frege's “tolerant” attitude ** But this attitude is not 
linguistically adequate—nor, for that matter, 1s 1t consistent with Frege’s 
explicit appeal to Kantian analytcity in his argument for senses on the 
basis of identity sentences 

A recent approach to the classical description theory based on the 
actuality operator at first looks as if 1t avoids this problem by nigidi- 
fying the reference of proper names directly with respect to their 
reference in the actual world ** The idea is to use a rigidified de- 
scription like ‘actually John Smith! or ‘the John Smith in the actual 
world’ to fix reference But multiple bearerhood prevents rigidifying 
the reference of names in this way The problem is that there are too 
many John Smiths ın the actual world Which one of them 1s the John 





pay-offs First, unhke Kripke's account of names, PMT keeps the linguistic possi- 
bility of muluple reference separate from the extralinguistic mechanism which 
imposes the constraint that referentially equivocal types have contextually definite 
tokens Second, the description which determines the reference of a token in one 
context does not automatically constrain other tokens of the same name type in 
other contexts Accordingly, PMT encounters none of the counterfactual counter- 
examples that plague the classical description theory where the same description 
determines the referent of all the tokens of a name type 

42 See “Names without Bearers,” pp 17-23 

43 Translations from the Philosophical Writings of Gottlob Frege, p. 588* 

44 Frank Jackson, "Reference and Description Revisited," in JE Tomberlin, ed , 
Mind, Language, and Ontology Philosophical Perspectwes, Volume 12 (Cambridge 
Blackwell, 1998), pp 201-18 
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Smith? In order to prevent the multiple bearers of a name like ‘John 
Smith’ from raising this problem, it 1s necessary to adopt Kripke's 
proposal, but, as we have seen, the evidence about multiple bear- 
erhood 1n natural language rules it out 


III IMPROPER NAMES 

Contrary to both PMT and neo-Milhanism, (T) 1s true There are 
names in English with senses containing substantive properties Their 
existence has at times been noted, but they have not received the 
attention necessary for their significance to be properly understood 
This, I submit, ıs because the categories Proper Noun and Name have 
been wrongly taken to be coextensive with the category Proper Name 
So, let us begin with a better classification 

Both proper and common nouns have senses, but senses of the 
former contain the metalinguistic predicate ‘x 1s a bearer of ‘N’’, 
while the senses of the latter do not Proper nouns divide into proper 
names, such as (16) 


(16) Dartmouth, Aristotle, Socrates, Moses, Godel, 
and improper names, such as (17a) and (17b) 


(17a) Jack the Ripper, Attila the Hun, Richard the Lion Hearted, Ivan 
the Terrible, Dennis the Menace, Mandrake the Magician, 
(17b) Superman, Batman, Spiderman, Plastic Man, Wonder Woman, 


The senses of 1mproper names, unlike the senses of proper names, 
contain substantive properties 1n addition to the metalinguistic pred- 
icate ‘x 1s a bearer of ‘N’’ 

Consider some of the evidence 1n favor of (T) Improper names 
contain meaningful syntactic constituents For example, the definite 
descriptions in (17a), ‘the Ripper’, ‘the Hun’, ‘the Lion Hearted’, 
and so on, seem to be as straightforwardly meaningful as any other 
modification Indeed, the improper nouns in (17a) initially strike us 
as frozen forms of compositionally meaningful modifications We can 
easily imagine that the definite description ‘the Terrible’ first oc- 
curred as a modifier distinguishing that Tsar Ivan from the other tsars 
of the same name 

Another form of evidence that senses of improper names contain 
substantive properties 15 the recursive expandability of the definite 
descriptions in improper nouns like those in (17a) Not only does the 
sense of ‘Dennis’ compositionally combine with the sense of ‘the’ and 
‘menace’ to form the sense of ‘Dennis the Menace’, but modification 
1s possible for the nouns within such improper names—for example, 
we have forms like (18) 
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(18) Dennis the Elm Street Menace, Dennis the Elm Street and School 
Yard Menace, 


Such recursive expansions are compositionally meaningful at each 
step, and each step differs in meaning from the previous one by Just 
the sense of the modifier added at that step 

Stll further evidence for the meaningfulness of improper names 
comes from the fact that, unlike proper names, they translate into a 
foreign language Thus, the proper name ‘Ludwig Wittgenstein’ and the 
proper name ‘Heinrich Himmler’ are not rendered ‘Louis Wittgenstein’ 
and ‘Henry Himmler’ in English, but the improper name ‘Richard the 
Lion Hearted’ ıs rendered ‘Richard Coeur de Lion’ in French 

The sense of the improper name ‘Superman’ contains a substantive 
property something like man with superhuman powers (for example, 
leaping tall buildings in a single bound and moving faster than a 
speeding bullet) The sense of ‘Batman’ contains a substantive prop- 
erty something like man who uses bat symbolism in his superhero role The 
sense of ‘Plastic Man’ contains a substantive property something like 
man able to stretch his body into virtually any shape 

Having a different grammatical structure from the examples in 
(17a), improper names like those in (17b) are not expandable pro- 
ductively Whereas ‘Ivan the Terrible’ ıs a phrase containing a proper 
noun and a modifier, ‘Batman’ 1s a compound common noun like 
‘blackboard’ The productivity ın the latter cases ıs thus different It 
results ın compound proper nouns, for example, ‘Batmobile’ with 
the derivative meaning automobile designed unth bat symbolism for Batman 
to use «n his superhero role 

To show that improper names like those in (17b) are not proper 
names, let us imagine a superhero who uses gold symbolism ın his 
superhero role and who is, accordingly, called ‘Goldman’ The mean- 
ingfulness of the improper name ‘Goldman’ is evident when we 
compare ıt to the orthographically identical proper name ‘Goldman’ 
Unlike the superhero, a bearer of the proper name 'Goldman' can be 
a woman and have no special relation to gold Further, the mean- 
ingfulness of improper names 1s shown by fact that the plural of the 
improper name 1s ‘Goldmen’, as in (19a) 


(19a) Three Goldmen came to the rescue in Goldmobiles 
while the plural of the proper name ıs ‘Goldmans’, as in (19b) 
(19b) Three Goldmans came to visit us in Oldsmobiles 


That is to say, the occurrence of ‘man’ in the improper name, m 
contrast to 1ts occurrence ın the proper name, undergoes the cus- 


X 
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tomary pluralization from ‘man’ to ‘men’ with the customary shift in 
meaning from one to many 55 

The substantive properties in the sense of improper names con- 
stram their reference ın essentially the same way as the substantive 
properties in the sense of common nouns Thus, in literal uses, the 
substantive properties 1n the sense of an improper name express 
necessary properties of 1ts bearer No one can literally be Superman 
if he has only ordinary human powers Nor can the superhero be a 
female Note that saying no woman can literally be Superman does 
not say that the improper name cannot be used to refer to a woman 
Suppose the Kents mistakenly took Jar El, a female, to be male, and 
raised her as a male Realizing that she can be a more successful 
superhero 1n the guise of a man, she maintains the fiction that she 1s 
one On this scenario, the Metropolis citizens’ use of ‘Superman’ 
(“Look in the sky it’s Superman") to refer to a woman 1s a case of 
reference under a false description In another kind of case, a 
cowardly Richard commanded his subjects to call him ‘Richard the 
Lion Hearted’ In still another, a harmless Ivan, due to bad press, 
comes be to called ‘Ivan the Terrible’ Such cases are also reference 
under a false description 

Because the senses of improper names involve substantive proper- 
ties, improper names, 1n contrast to proper names, have substantive 
application conditions like common nouns The proper name 
‘Ebeneezer Scrooge’ applies to Scrooge even after he is fright- 
ened into becoming a model of Christian charity, but the improper 
name ‘Dennis the Menace’ does not apply to Dennis after he ma- 
tures, changes his threatening ways, and becomes a harmless, well- 
intentioned adult Technically, people might still call him ‘Dennis 
the Menace’, but only in a figurative way, 1n order, say, to remind 
him, meanspiritedly, of what he had once been like 

Things are more complicated with a case like ‘Ivan the Terrible’ It 
seems reasonable to say that Ivan the Terrible bears this name even in 
old age when he is too feeble to strike fear in anyone We might even 
say of him in his dotage “Ivan the Terrible ıs an old sweetie-pie " This 
shows that the assignment conditions for improper names do not 
have the same temporal structure as their application conditions 
Putting this complication aside, the point ıs that, for Ivan to be Ivan 
the Terrible, he must have earned his bad reputation at some period 
during his life Otherwise, the expression would be merely an ironic 
form like ‘L1’] Abner’ 


45 The last point 1s due to Armour-Garb (ın conversation) 
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Given the existence of 1mproper names, (T) ıs true Thus, we must 
restrict (1) to be just a schema for senses of proper names and then 
add a separate schema for senses of improper names Hence, (1’) 


(1') (x) I£'N' 1s a proper name, its sense 1s the T which 2s a bearer of N’ 
(* T" 1s, in effect, the dummy term ‘thing’ m (1)) 
(1) If ‘N’ 1s an improper name, the sense of ‘N’ 1s the T* which is 
a bearer of N’, where ‘T*’ 1s a term which, unlike the dummy 
term ‘T, contains substantive properties 


In cases like (17a), the occurrence of Tın the proper name compo- 
nent of an improper name like ‘Dennis the Menace’ becomes an 
occurrence of T* in the compositional formation of a sense of the 
name the sense of the appositional phrase fleshes out the dummy 
term (see the above discussion of the Paderewski case) In cases like 
(17b), an occurrence of T* 1s an inherent, that is, lexical, feature of 
the improper name 
Kripke concedes that 


There may be some cases where the description picture 1s true, where 
some man really gives a name by going into the privacy of his room and 
saying that the referent is to be the unique thing with certain identifying 
properties ‘Jack the Ripper’ was a possible example which I gave *6 


Despite the fact that Knpke puts the concession as a possibility, it 1s 
quite clear that, with the grammatical distinctions available in Naming 
and Necessity, ‘Jack the Ripper’ ıs a proper name like any other, and, 
as a consequence, the expression poses a counterexample to the 
Millian view that names do not have a sense For, at the very least, 
‘Jack the Ripper’ has the sense causer of cuts or tears Given that the 
literal referent of tokens of ‘Jack the Ripper’ 1s semantically required 
to be the causer, of cuts or tears, Kripke must concede that “the 
description picture 1s true” for some cases 

Kripke also writes as if this concession 1s necessary only for one or 
another marginal case—as if the ‘Jack the Ripper’ case 1s the exception 
and cases like (16) the rule But the distinction here 1s not that between 
exception and rule but that between attributive uses of a description to 
fix the bearer of a name and referential uses (or ostension) The ‘Jack 
the Ripper’ case can serve as a model for a general strategy of introduc- 
ing proper as well as improper names Could not a proper name 
‘Dartmouth’ have had its reference fixed in the same manner as the 
improper name ‘Jack the Ripper’? Surely, the Britsh authorities could 
have gone into the privacy of their chambers and stipulated that the 


46 Naming and Necessity, p 94 
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referent of ‘Dartmouth’ is the city, whichever it 1s, that 1s situated at the 
mouth of the river Dart If Dartmouth had been named ın this way, then, 
on Kripke’s own account, ‘Dartmouth’ would be synonymous with ‘the 
city situated at the mouth of the river Dart’ Hence, this proper name 
would have a sense 

Gareth Evanst” was the first, to my knowledge, to suggest that an 
attributive use of a description in a baptism can provide a proper name 
with substantive content His example was the name ‘Julius’, introduced 
by the stipulation that the name refers to the man, whoever it 1s, who 
invented the Zip Evans calls such names “descriptive names” because he 
takes them to have substantive content and thinks “it 1s not very plausible 
to deny that ‘Juhus’ ıs a member of the same semantical category as 
other proper names" (:bıd., pp 31-32) The existence of an open set of 
cases of attributive uses of descriptions 1n baptisms begins to make it look 
as 1f Kripke’s concession to “the description picture" 1s far more dam- 
aging than he led us to think 

Evans’s ‘Julius’ case, my ‘Dartmouth’ case, and perhaps even 
Kripke’s ‘Jack the Ripper’ case are hypothetical cases, but there 1s 
reason to think that many actual cases of naming work the way they 
do The reason 1s that many situations will be like the one that 1s 
imagined ın the ‘Julius’ and ‘Jack the Ripper’ cases There ıs a 
generalization these are cases where the baptizers have no direct 
access to the object to undergo baptism, where their only access to it 
1$ qua satisher of a certain description This is the case in many 
theoretical baptisms, such as 1n Kripke's *Neptune' case, and also in 
many mundane ones ?? Further, ın some situations, the object does 
not yet exist, as when an inventor has the 1dea for a product and a 
marketing name but has yet to invent the bearer Hence, ıt seems 
reasonable to thunk that, 1n the course of history, there have been 
many baptisms based on attributively used descriptions 4° 


47 The Vanetws of Reference (New York Oxford, 1982) 

48 Naming and Necessity, pp 79, 96 

49 Two things that Kripke says bear on this He says “First, the description used is not 
synonymous with the name it introduces but rather fixes its reference" (Naming and 
Necessity, p 96, footnote 42) This 1s unhelpful because, while what Knpke says may be 
true for perhaps the larger number of cases, the issue here 1s whether there are not also 
a significant number of cases where the description used to refer to the baptzee 
provides its name with a sense Kripke also says “Second, most cases of initial baptism 
are far from those which onginally inspired the description theory” (zzd.) To be sure, 
this raises a problem for description theonsts in cases of figures from the distant past 
where the relevant descriptions are unknown to contemporary speakers But the fact 
that classical descriptivists have their own troubles 1s no solace to neo-Millans For, even 
1f we do not know specifically in which cases the bapitizer has employed a description 
attributively, ıt ıs enough to know that there are proper names which are synonymous 
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But, on closer inspection, names like Evans’s ‘Julius’ turn out not 
to be counterexamples to the core Millian thesis Evans says that “it 1s 
not very plausible to deny that ‘Julius’ ıs a member of the same 
semantical category as other proper names,” but this ıs true only if 
neo-Millians are limited to the grammatical distinctions in Naming 
and Necessity If they adopt the distinctions in our classification, 
neo-Millians can argue that a critic like Evans 1s not entitled to say 
anything stronger than that ‘Julius’ is a member of the category 
Proper Noun Hence, neo-Millians have a response to Evans-style 
criticism the proper noun ‘Julius’ 15 an abbreviation for an improper 
name As Evans presents the case, the term ‘Julius’ 15 introduced on 
the basis of a speech act performed with some such sentence as “Let 
us call whoever invented the Zip ‘Julius’ "5? The act requests us to use 
‘Julus’ as short for the improper name ‘Julius the Inventor of the 
Zip’ Since Evans’s term ‘Julius’ ıs not a proper name, but, in effect, 
an umproper name, it does not contradict the restricted Millian thesis 
that proper names have no sense ?! 

Hence, the mistake which led to Kripke's concession to "the de- 
scription picture" was similar to our mistake and ıs similarly reme- 
died Neo-Millians should distinguish between proper names and 
improper names, and restrict the Millian thesis to the former Noth- 
ing much 1s lost, since proper names were all along the only names of 
real concern to them 9? 

It looks, however, as 1f something important is lost nonetheless, 
namely, Kripke's claim that certain a priori truths are not necessary 


with a description for neo-Millians to have to concede that some proper names have a 
sense 

50 The Vaneties of Reference, p 91 

5! The general strategy for the other cases in question 1s to model them on the 
preceding construal of the ‘Julius’ case, that is, to treat the name in cases where an 
attributive use of a description fixes the reference as an abbreviation of an improper 
name Proper names, for neo-Millians, are proper nouns where a referential use of 
a description or an ostension in the baptismal ceremony fixes their reference 
Proper names, the neo-Millian can continue to claim, have no sense 

On this treatment, attribuuvely used descriptions do not introduce the possibility 
that a name like ‘London’ with multiple bearers is ambiguous If the name 
‘London’ has multiple bearers on the basis of attributive uses of n nonsynonymous 
descriptions, then we have ill-chosen abbreviations of n improper names 

52 Even in the case of improper names, senses are not reference determiners 
There could be two Dennis the Menaces Or, there could be two Supermen, that is, 
two visitors from Krypton with the same superhuman powers, both of whom were 
raised by a kindly old couple named ‘Kent’, and so on, and both of whom go into 
the superhero business under the name ‘Superman’ Hence, the senses of ‘Dennis 
the Menace’ and ‘Superman’ do not suffice to determine the definiteness of 
reference required in a use of a name 
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truths 5? To begin, note that truths expressed by sentences like ‘Julius 
is the inventor of the Zip’ count as necessary truths because their 
unabbreviated form 1s an analytic proposition like Jukus the Inventor of 
the Zip 15 the inventor of the Zip. Thus, 1f, as Kripke assumes in Naming 
and Necessity, analytic propositions are instances of logical truths, he 1s 
wrong to deny that an a priori statement like (20) 1s a necessary truth 


(20) Neptune [that 1s, the cause of such and such observed perturba- 
tions of the orbits of such and such planets] 1s the cause of such 
and such observed perturbations in the orbits of such and such 
planets 


If (20) 1s analytic in the manner of ‘Julius is the inventor of the Zip’, (20) 
1$ a necessary truth Further, if Leverner assigns the name ‘Neptune’ on 
the basis of the description ‘the cause, whatever 1t 1s, of such and such 
observed perturbations 1n the orbit of such and such planets', 1t makes 
no sense for Leverner to think that "some other object might have 
caused the perturbations in its place,” as Kripke imagines ın defending 
his claim about apriority and necessity This 15 like thinking “Homer 
didn’t write the Jhad and the Odyssey, 1t was some other ancient Greek by 
the same name " So, Kripke seems to have been wrong ın claiming that 
a priori truths like (20) are not necessary truths 

On the assumption that analyücity 15 what Fregean intensionalism says 
1t 15, Kripke's claim 1s wrong But if analyticity is what our non-Fregean 
intensionalism says ıt 15, his claim ıs right. Since our notion of analyticity 
1s not Frege’s logical notion but rather the mereological notion de- 
scribed ın section 1, an analytic sentence like (4) does not express a 
necessary truth in spite of the fact that we can know it 1s analytic a priori 
On our intensionalism, the analyticity of (4) permits us to say no more 
than that our uses of it either make a true statement or no statement at 
all In our world, (4) makes a true statement, but the statement 1s 
contingent because Socrates might not have been a bearer of ‘Socrates’ 
Similarly, 1n our world, (20) makes a true statement, but the statement 
1s contingent because Neptune might not be the cause of such and such 
perturbations in the orbits of such and such planets 5* 


IV AN ARGUMENT AGAINST MILLIANISM 
Although neo-Millians can still claim that proper names do not have 
a sense, neither Mill's argument nor any of the Knpkean variations 
can establish the clam Such arguments are only arguments against 
Fregean senses Milhan arguments can (and do) decide whether 


53 Naming and Necessity, p 79, footnote 33 
54 For, as Kripke points out, ıt could have “been knocked off its course one 
million years earher"—Nam:ng and Necessity, p 79, footnote 33 
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proper names have Fregean senses, since that question falls within 
the domain of reference They cannot decide whether proper names 
have senses, since this question falls within the domain of sense I 
shall now give an argument that decides the question 53 

Neither Mill himself nor neo-Millians like Kripke, Putnam, and Keith 
Donnellan ıs an extensionalist ë Neo-Millians take common nouns, 
verbs, adjectives, adverbs, and so on to be meaningful, and, presumably, 
when presented with the evidence m the last section, they would agree, 
as Kripke already does ın connection with the ‘Jack the Ripper’ case, that 
improper names, too, have asense Their denial that proper names have 
sense is not a consequence of an objection to senses per se, such as 
Quine's, but denves instead from what those philosophers take to be the 
special character of proper names Since neo-Millians are thus inten- 
sionalist with. respect to a large portion of the language, they require a 
theory of sense for the language as a whole Without one, their seman- 
tics i5 incomplete in the manner of an account of gender that addresses 
one gender but not the other 

A theory of sense for the language must meet the criteria for 
linguistic theories. A linguistic theory, whether about phonology, 
syntax, or sense 1s Justified on the basis of its success in explaining the 
relevant properties and relations of sentences Accordingly, ascrip- 
tions of senses to expressions of the language must be based on the 
role the ascriptions play in explaining the sense properties and 
relations of sentences If neo-Milhans were to choose a theory of 
sense that fails to satisfy reasonable criteria for linguistic theories, 
their position would inherit the failure of the chosen theory of sense 


55 My earher attempts to answer this question were, for the most part, based on the 
strategy of providing a version of the description theory that escapes the problems of 
both the classical version and Milhan theories. This strategy raised various tangental 
issues about semantic and pragmatic differences between the PMT and Millian theories 
that have obscured the fundamental issue between them Here my strategy 1s to show 
that the issue 1s just Millranism and to argue directly against it. 

56 Putnam—"The Meaning of ‘Meaning’,” ın Keith Gunderson, ed , Language, 
Mind, and Knowledge (Minneapolis Minnesota UP, 1975), pp 131-93—1s up front 
about being a Fregean intensionalist His criticism of Frege 1s only that senses as 
reference determiners leads to the externalist view that knowledge of meaning ts 
not entirely in the head. Donnellan—"Necessity and Criteria," this JOURNAL, LIX, 22 
(October 25, 1962) 647-58—1s also an mtensionalist, citing ‘cat’ and ‘red’ as 
examples of terms with senses Donnellan changed his view on natural-kind terms, 
but that change does not entrain a change m view concerning color terms (‘red’), 
artifact terms (for example, ‘knife’), institutional terms (for example, ‘bachelor’), 
linguistic terms (for example, ‘anagram’), and mathematical terms (for example, 
‘prime’) Though more cagey, Kripke (Naming and Necessity, pp 127-28), too, 1s an 
mtensionalist, taking terms like ‘foolish’, ‘fat’, and ‘yellow’ as having senses 
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A linguistic theory 1s in the business of explaining all the phenom- 
ena ın 1ts domain, that ıs, of explaining the linguistic properties and 
relations of all the expressions that have the appropriate linguistic 
properties and relations Thus, theories of sense can show no favor- 
itism with respect to explaining phenomena ın the domain of sense 
Just as theories of syntax can show no partiality toward the explana- 
tion of the syntactic properties and relations of one or another class 
of expressions, so theories of sense can show no partiality toward the 
explanation of the sense properties and relations of one or another 
class Hence, neo-Millians are committed to a theory of sense that 
explains the sense properties and relations of all expressions and 
sentences of the language having such properties and relations 

Thus, if sentences containing proper names have sense properties 
and relations, the theory of sense to which neo-Millians are commit- 
ted has to explain their sense properties and relations We have 
already noted that some sentences containing proper names have the 
sense property of being meaningful (5c) 15 an example We have also 
noted that some sentences containing proper names are nonsynony- 
mous (5a) and (5b) are examples Agam, we have noted that some 
sentences containing proper names are analytic and some analytically 
entail others (4) 1s an example of analyticity, while (2) and (3) 
provide an example of analytic entailment Further, (21) and (22) are 
examples of sentences that are ambiguous (between an analytic and 
a synthetic sense—see footnote 11) 


(21) Boys are bearers of ‘Boy’ 
(22) Smiths are bearers of ‘Smith’ 


Finally, (23) 1s synthetic and (24) contradictory 


(28) John Locke 1s a bearer of ‘Giovanni Locke’ 
(24) John Stuart Mill 1s not a bearer of ‘John Stuart Mull’ 


Compositionality enables us to drive the argument down from 
sentences containing proper names to the proper names they con- 
tam An explanation of how it ıs that a sentence has the sense 
properties and relations it has 1s an explanation of how it 1s that the 
sentence has the sense(s) ıt has For example, an explanation of a 
sentence being meaningful ıs an explanation of its having at least one 
sense, and an explanation of a sentence being ambiguous ıs an 
explanation of its having two or more senses Now, since meaning 1s 
compositional, the explanation of a sentence’s having the sense(s) ıt 
has (idioms notwithstanding) 1s an explanation of how those sense(s) 
are formed compositionality from the senses of its lexical items and 
its syntax Thus, an explanation of the meaningfulness or ambiguity 
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of a sentence 1s an account of how its sense(s) are formed composr- 
tionally from the senses of its lexical 1tems 

If the proper name ‘Santa Claus’ in (5c) had no sense to contribute to 
the compositional meaning of the sentence, (5c) would not have a sense 
and would not be meaningful If the proper names ‘Boy’ and ‘Smith’ in 
(21) and (22) had no sense to contribute to the compositional meaning 
of those sentences, these sentences would not have two senses and would 
not be ambiguous Hence, the fact that (5c) has a sense and that (21) 
and (22) each have two senses means that their subjects, the proper 
names ‘Santa Claus’, ‘Boy’, and 'Smith', have a sense The story ıs the 
same 1n the case of the explanation of the sense properties and relations 
of (2), (3), (4), (23), and (24) Since compositionality commits a theory 
of sense to ascribing senses to proper names in such sentences in order 
to account for their sense properties and relations, those who are com- 
mitted to a theory of sense are committed to the descriptivist thesis that 
proper names are meaningful 57 

Thus, in acknowledging the meaningfulness of common nouns 
and other expressions including improper names, neo-Millians com- 
mit themselves to a theory of sense that explains sense properties and 
relations of sentences The fact that sentences containing proper 
names have sense properties and relations means that the theory has 
to explain such sentences, and, given the compositional nature of 
explanation, 1ts explanations must ascribe senses to the proper names 
in sentences Given compositionality, neo-Millians are committed to 
a theory of sense that says proper names have a sense Hence, 
neo-Millians hold a philosophical theory that claims proper names 
have no sense and a linguistic theory that claims the opposite 


57 The senses that a theory of sense commits us to saying that proper names have 
must be instances of (1’)(1) Nothing less accounts for the contribution of a proper 
name to the compositional explanation of the sense properties and relations of sen- 
tences Ascribing the sense ‘the thing which 1s a bearer of "Socrates" to the proper 
name “Socrates” explains why (4) 1s trifling, why its predicate merely repeats what its 
subject already says Ascribing a sense which ıs an instance of (1’)(1) to ‘John Locke’ 
explains the nonanalyticity (23), since, on this ascription, the sense of (23) has the form 
‘a bearer of ‘N’ 1s a bearer ‘M’’ And, ascribing a sense which is an instance of (1’) (1) 
to ‘John Stuart Mill’ in (24) explains the contradictoriness of (24), since, on this 
ascription, the sense of (24) has the form ‘a bearer of ‘N’ is not a bearer of 'N" 
Correspondingly, nothing more than an instance of (1') (1) would satisfactorily repre- 
sent the contribution of a proper name to the compositional meaning of a sentence, 
since nothing more is needed to account for its sense properties and relations Ac- 
cordingly, simplicity considerations limit ascriptons to instances of (1') (1) 
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V EPILOGUE 
Referring to some of the major arguments that proper names have a 
sense, including Frege’s argument that sentences like (5a) are ana- 
lytic whereas sentences like (5b) are not, Kripke says 


It would be nice to answer all of these arguments I am not entirely able 
to see my way clear through every problem of this sort that can be 
raised Nevertheless, I think it’s pretty certain that the view of Frege 
and Russell is false 55 


Kripke was nght to think that classical descriptivism 1s false, but 
wrong to think it possible “to answer all of these arguments” De- 
scriptivism being true, they are largely persuasive arguments for a 
true conclusion In attempting to defend Milhanism with Milhan 
arguments, Kripke was trying to do the impossible He was trying to 
refute the descriptivist thesis that proper names have a sense with 
arguments that work only against the Fregean thesis that they have a 
reference-determining sense 
As we have seen, Kripke’s faith that Milian arguments work against 
the descriptivist thesis rests on his equation of sense with Fregean sense 
and mtensionalism with Fregean intensionalism Kripke 15 by no means 
alone in equating them Wittgenstein, Quine, and most contemporary 
analytic philosophers also equate them Being widespread, the equation 
has had widespread consequences Elsewhere, I have argued that it has 
limited the approaches that philosophers have considered ın many 
controversies besides the controversy about names, and, further, that, ın 
those controversies, too, the failure to consider non-Fregean intension- 
alism has prevented an intuitively satisfying resolution °° If intuitively 
satisfying resolutions of those other controversies also come from intro- 
ducing non-Fregean intensionalism into them, then the foundations of 
analytic philosophy—the tradition which comes down to us from Frege, 
Russell, Wittgenstem, and Quine and within which most Anglo-Amen- 
can philosophers now work—is due for a much-needed re-examination 
JERROLD J KATZ 
Graduate Center/City University of New York 


58 Naming and Necessity, p 29 

59 The Metaphysics of Meaning (Cambridge MIT, 1990), “Analyticity, Necessity, and 
the Epistemology of Semantics”, "The Problem in Twentieth-Century Philosophy,” 
this JOURNAL, XCV, 11 (November 1998) 547-75, Sense, Reference, and Philosophy 
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SYSTEMATICITY AND THE COGNITION 
OF STRUCTURED DOMAINS* 


he current debate over systematicity concerns the formal 
conditions a scheme of mental representation must satisfy 1n 
order to explain the systematicity of thought ! The systema- 
ticity of thought 1s assumed to be a pervasive property of minds, and 


* Versions of this article were presented before the Society for Philosophy and 
Psychology (June 1999), CogSci99 (July 1999), and the American Philosophical 
Association (Apnl 2000) and at various universities We thank audiences and 
commentators for their helpful remarks Research was funded by National Science 
Foundation Grant # 9976739 

1 The debate was instigated by Jerry A. Fodor and Zenon W Pylyshyn, “Connec- 
tionism and Cognitive Architecture,” Cognition, XXVIII (1988) 3-71 In the present 
article, we mainly focus on that paper and on Fodor and Bnan P McLaughlin, 
*Connectionsm and the Problem of Systemaucity Why Smolensky’s Solunon 
Doesn't Work,” Cognition, xxxv (1990) 183-205, and Paul Smolensky, Géraldine 
Legendre, and Yoshiro Miyata, “Principles for an Integrated Connectionist/Sym- 
bolic Theory of Higher Cognition,” (Tech Report 92-08, Insutute of Cognitive 
Science, University of Colorado) See footnote 2 for other papers detailing Fodor's 
position and footnote 5 for papers detailing Smolensky’s Other important partic- 
ipants in the debate were Kenneth Aizawa, “Explaining Systematicity,” Mind and 
Language, x11 (1997) 115-36, Keith L Butler, “Towards a Connectionist Cognitive 
Architecture,” Mind and Language, v1 (1991) 252-72, Butler, “Compositionality in 
Cognitive Models The Real Issue," Philosophical Studies, LXXVII (1995) 153-62, 
David J Chalmers, “Connectionism and Compositionality Why Fodor and Pylyshyn 
Were Wrong," Philosophical Psychology, v1 (1993) 305-19, Nick Chater and Mike 
Oaksford, “Autonomy, Implementation and Cognitive Architecture A Reply to 
Fodor and Pylyshyn,” Cognition, xxiv (1990) 93-107, Robert Cummins, “Systema- 
ucity," this JOURNAL, XCIII, 12 (December 1996) 591-614, Robert F Hadley, “Sys- 
tematicity in Connectionist Language Learning,” Mend and Language, 1x (1994) 
247-72, Hadley, “Systematicity Revisited,” Mind and Language, 1X. (1994) 431-44, 
Hadley and Michael B Hayward, "Strong Semantic Systematiaity from Hebbian 
Connectionist Learning," Minds and Machines, VII (1997) 1-55, Hadley, “Cognition, 
Systematicity, and Nomic Necessity,” Mind and Language, X1 (1997) 137-53, Ter- 
ence Horgan and John Tienson, "Structured Representations 1n Connectionist 
Systems?" in Steven Davis, ed , Connectionism. Theory and Practice (New York Oxford, 
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can be characterized (roughly) as follows anyone who can think T 
can think systematic variants of T, where the systematic variants of T 
are found by permuting T’s constituents So, for example, it is an 
alleged fact that anyone who can think the thought that John loves 
Mary can think the thought that Mary loves John, where the latter 
thought 1s a systematic variant of the former 


I BACKGROUND 

The systematicity of thought itself, of course, cannot be directly 
observed But 1t 1s easy to see why it 1s widely assumed Anyone who 
can understand a sentence S can understand its systematic variants 
Since understanding S requires having the thought it expresses, 
anyone who can think the thought expressed by a sentence Scan have 
the thought expressed by the systematic variants of S The systema- 
ticity of all thought expressible 1n language seems to follow if you can 
understand ‘John loves Mary’, you must be capable of the thought 
that John loves Mary Since anyone who can understand ‘John loves 
Mary’ can understand ‘Mary loves John’, it follows that anyone who 
can have the thought that John loves Mary can have the thought that 
Mary loves John 

To derive the thought one must have to understand the systematic 
variants of ‘John loves Mary’, permute the words 1n ‘John loves Mary’ 
(while mamtaming grammaticality), and preface the result with 
‘that’ This leads to what we might call the orthodox position which Jerry 
A Fodor, Zenon W Pylyshyn, and Brian P McLaughlin? defend 
concerning the explanation of systematicity, namely, that ıt ıs best 
understood as involving two parts 


1991), pp 195-228, Robert J Matthews, "Three-concept Monte Explanation, 
Implementation, and Systematicity,” Synthese, c1 (1994) 347-63, Matthews, “Can 
Connectionists Explain Systematicity?" Mend and Language, xu (1997) 154-77, 
Lars F Niklasson and Tim van Gelder, “On Being Systematically Connectionist,” 
Mind and Language, 1x (1994) 288-302, van Gelder, “Compositionality A 
Connectionist Variation on a Classical Theme,” Cognitive Science, xiv. (1990) 
355-84 

? Fodor, Pylyshyn, and McLaughlin’s position 1s fully laid out in the following 
articles Fodor and Pylyshyn, "Connectionism and Cognitive Architecture”, Fodor 
and McLaughlin, “Connectionism and the Problem of Systematicity Why Smolen- 
sky’s Solution Doesn’t Work”, Fodor, “Connectionism and the Problem of Systema- 
ticity (Continued) Why Smolensky's Solution Sull Doesn't Work,” Cognition, LXII 
(1997) 109-19, McLaughlin, “Systematicity, Conceptual Truth, and Evolution,” in 
Christopher Hookway and Donald Peterson, eds , Philosophy and the Cognitwe Sa- 
ences, Royal Institute of Philosophy, Supplement no 34 (New York Cambridge, 
1993), pp 217-34, McLaughlin, “The Connectionism/Classicism Battle to Win 
Souls,” Philosophical Studies, LXXI (1993) 163-90 Hereafter, ‘FPM’ 1s used ın the text 
to refer to the authors’ position, result, and the like, otherwise, their full names are 
used 
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(1) The representational theory of thought having the thought that p 1s 
having a f-expressing mental representation ın a certain cognitive 
role 


For example, having a belief that p amounts to having a mental 
representation that p 1n the belief box ? 


(2) Mental representation 1s "classical" mental representation has a 
language-like combinatorial syntax and associated semantics * 


Putting these two parts together, we get that anyone who can think 
that John loves Mary can think that Mary loves John, since (1) thinking 
‘Mary loves John’ involves tokening a representation of the proposi- 
tion that Mary loves John, and (1) that representation has constituents 
corresponding to Mary, John, and the relation of loving, which can 
simply be permuted to yield a representation, and hence a thought, 
corresponding to the proposition that John loves Mary Fodor, Pyly- 
shyn, and McLaughlin thus conclude that the human system of 
mental representation must be “classical,” that 1s, a language-like 
scheme having the familar kind of combinatorial syntax and associ- 
ated semantics first introduced by Alfred Tarski (op ct) 

An unfortunate consequence of the way Fodor, Pylyshyn, and 
McLaughlin have characterized the systematicity of thought 1s that 
any theory that accounts for understanding every sentence will ac- 
count for systematicity trivially if one can understand every sentence, 
one can understand every systematic variant of any given sentence If 
the domain of sentences to be understood 1s finite, nonclassical 
schemes could be complete in the relevant sense, and hence account 
for systematicity For instance, a look-up table that uses arbitrary 
names to represent each sentence of the domain would account 
for it 

A natural reply to this point would be to claim that only represen- 
tational schemes employing something like classical combinatorics 
could be complete for an unbounded domain like the domain of 
thought-expressing sentences ın a natural language But Paul Smo- 


3 This assumption of the representational theory of thought is controversial One 
might think that a person can have the thought that p without having the repre- 
sentation that p See Cummins, Representations, Targets, and Attitudes (Cambridge 
MIT, 1996) 

4 We call these schemes “classical” ın what follows (one could also call them 
“Tarskian”) See Alfred Tarski, “The Concept of Truth in Formalized Languages” 
(1936), ın his Logie, Semantics, Metamathematics (New York Oxford, 1956), pp 
152-278, “The Semantic Conception of Truth” (1944), in Herbert Feigl and Wilfnd 
Sellars, eds , Readings in Philosophical Analysis (New York Appleton, 1949), pp 52-84 
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lensky, Géraldine Legendre, and Yoshiro Myyata? have proven that for 
every classical parser—that 1s, a parser defined over classical repre- 
sentations—there exists a tensor-product network that 1s weakly (1n- 
put-output) equivalent to the classical parser but does not employ 
classical representations 9 Thus, ıt appears that the classical explana- 
tion and the tensor-product explanation explain the systematicity of 
thought equally well 

There are two philosophical responses to the SLM result that we want 
to surface briefly and put aside The first, due to Georg Schwarz,” is 
that networks like that designed by Smolensky, Legendre, and Miyata 
do not really have an unbounded competence The second, sug- 
gested by Fodor, Pylyshyn, and McLaughlin, and discussed briefly by 
Cummins, is that, while both parsers? can account for the systema- 


5 Smolensky, Legendre, and Miyata’s position 1s fully laid out in the following 
articles Smolensky, “The Constituent Structure of Connectionist Mental States,” 
Southern Journal of Philosophy Supplement, xxv1 (1987) 137-60, Smolensky, “Tensor 
Product Variable Binding and the Representation of Symbolic Structures ın Con- 
nectionist Systems," Artificial Intelligence, xLvi (1990) 159-216, Smolensky, "Connec- 
tionism, Constituency and the Language of Thought" in Barry Loewer and 
Georges Rey, eds, Meaning ın Mind Fodor and His Critics (Cambridge Blackwell, 
1991), pp 201-27, Smolensky, “Constituent Structure and Explanation in an Inte- 
grated Connectionist/Symbolic Cognitive Architecture,” m Cynthia MacDonald 
and Graham Macdonald, eds, Connechonism Debates on Psychological Explanation 
(Cambridge Blackwell, 1995), Smolensky, Legendre, and Miyata, “Principles for an 
Integrated Connectionist/Symbolic Theory of Higher Cognition ” Hereafter, ‘SLM’ 
1s used ın the text to refer to the authors’ position, result, and the like, otherwise, 
their full names are used 

6 Tensor-product encoding 1s a general-purpose technique for binding fillers to 
roles (object to properties/relations) An activation vector representing the binding 
of a filler f to a role R 1s obtained by taking the tensor product of the vectors 
representing Rand f The resulting vector can then be added to (superimposed on) 
others representing bindings of fillers to roles, yielding a single vector that repre- 
sents the binding of many fillers to many roles Smolensky, Legendre, and Miyata 
use this technique to construct recursive representations of binary trees Matrices 
effecting the vector operation for constructing a tree from its left child p and right 
child 4 (or for decomposing ıt) are then defined, making possible direct 1mple- 
mentation in one layer of connection weights 

This connectionist representation of trees enables massively parallel processing 
Whereas in the traditional sequential implementation of LISP, symbol processing 
consists of a long sequence of car, cdr, and cons operations, here we can compose 
together the corresponding sequence of Wea, Wears Weonso» Weonst Operations into 
a single matrix operation Adding some minimal nonlinearity allows us to compose 
more complex operations incorporating the equivalent of conditional branching 
(Smolensky, Legendre, and Miyata, :hıd.) The matrices can then be straightfor- 
wardly combined to yield a single layer of connection weights that implement 
parse-dependent predicates and functions 

7 “Connectionism, Processing, Memory," Connection Scrence, 1 (1992) 207-26 

8 “Systematicity ” 

? Smolensky, Legendre, and Miyata do not in fact construct a parser What they 
do 1s show how to construct networks effecting parse-dependent functions and 
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ticity data, the SLM explanation 1s unprincipled because the tensor- 
product parser 1s handcrafted to achieve systematicity, whereas the 
classical parser 1s not We discuss these briefly in turn 

Unbounded competences in connectionast networks Schwarz has argued 
that connectionist networks that are not simply implementations of 
classical architectures cannot have unbounded competences Accord- 
ing to Schwarz, we take an ordinary calculator to have an unbounded 
competence because it employs perfectly general numerical algo- 
rithms for addition, multiplication, subtraction, and division Conse- 
quently, its competence 1s limited only by tme and memory !? We 
can add more memory to the calculator, and let ıt run longer, without 
altering the algorithms 1t exploits But, Schwarz argues, the same does 
not hold of a connectionist calculator, for the only way to add 
memory 1s to add more nodes, or to add precision to the nodes 
Doing either of these will require retraining the network, however, 
which amounts to programing a new system that, 1n effect, executes 
a different algorithm The fact that processing and memory are fully 
integrated in connectionist networks blocks the standard idealization 
away from memory limitations that licenses attribution of unbounded 
competences to classical systems whose controlling algorithms re- 
main unchanged with the addition or subtraction of memory 

We believe this argument should be rejected for reasons given by 
Cummins !! The culprit ıs the assumption that the function com- 
puted 1s a function from input to output rather than a function from 
input and initial state to output and final state The input-to-output 
relation exhibited by a classical parser, for example, 1s a function of 
its stored knowledge Change what the system knows and the same 
input will yield a different output Therefore, the input-output rela- 
tion 1s seldom a function This 1s particularly obvious when we con- 
sider learning algorithms These cannot be conceived as functions 
pairing inputs and outputs, for the whole point of a learning algo- 
rithm 1s to replace ineffective input-output pairings with better ones 
By definition, learning algorithms remain constant over changes in 
input-to-output relations 





predicates (for example, active-passive) that correspond precisely to arbitrarily 
complex parse-dependent LISP functions/predicates In the context of the current 
discussion, this amounts to showing that classical representation 1s not required for 
systematic and productive capacities defined over the constituent structure of a 
sentence 

10 It 1s also limited, ultimately, by wear and tear, but this raises no special problem 
for connectionists 

11 “Systematicity ” 
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We accept the principle that a different function computed implies 
a different algorithm executed But the function computed by a 
connectionist system must be conceived as a function from an actı- 
vation vector and point ın weight space to another activation vector- 
weight space pair From this pomt of view, we do not build a new 
network when we change weights any more than we build a new 
rule-based system when we change its stored knowledge 

Connectiomist data coverage ıs "unprincipled ” The idea here 1s that 
classical representational schemes predict systematicity, whereas con- 
nectionist schemes at best accommodate 1t 

To get a concrete sense of this objection, suppose a classical system 
generates a phrase marker for ‘John loves Mary’ Since ‘Mary loves 
John’ has precisely the same phrase marker, except that ‘John’ and 
‘Mary’ have exchanged positions, the system ıs bound to be able to 
parse ‘Mary loves John’ if 1t can parse ‘John loves Mary’ the gram- 
matical structure 1s the same, and so are the lexical resources 


S S 


P dip NN QN 


NP VP NP VP 
PI o gis FS 
D N Ø NP D N Ø V NP 


V 
John love SAN Mary loves N 
D 


S 
N $ D N Ø 
Mary John 


Figure 1 


By contrast, so the argument goes, a connectionist network could 
be trained to parse ‘John loves Mary’ but not ‘Mary loves John’ 

An obvious reply 1s that while a classical system might be pro- 
gramed to exhibit the very same incapacity, the resulting system 
would be obviously ad hoc The incapacity does not appear ad hoc in 
the connectionist framework, however, since the representations of 
the sentences do not themselves have constituents corresponding to 
the relevant lexical items It might seem, therefore, that there 1s no 
reason why the network would have to process the two representa- 
tions in a similar way They are simply different activation vectors, and 
hence there will be no special problem with a training set that simply 
declares the second parse unacceptable Bias against a systematic 
variant of an acceptable sentence will present no learning prob- 
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lem different in principle from bias against any other unacceptable 
sentence—for example, ‘Loves Mary John’ A classical system, how- 
ever, will give both parses if ıt gives either, unless some special rule 1s 
added to the grammar to block one or the other There will be no 
rule or process in the deviant connectionist system that 1s special to 
this case The same setting of weights that deals with every other 
sentence, whether acceptable or unacceptable, will deal with the 
deviation There will, in short, be no principled distinction between 
the way ıt rejects ‘Mary loves John’ and the way it rejects ‘Loves John 
Mary’ 

Here 1s a different but related way to see the alleged problem 
Imagine a classical system that can parse ‘John loves Mary’ but cannot 
parse ‘John despises Mary’ because ‘despises’ 1s not 1n its lexicon It 
seems intuitively plausible to suppose that a simple learning algo- 
rithm will add ‘despises’ to the lexicon, and that this will suffice to 
yield the new parse The SLM result shows that for each of these 
classical parsers, there 1s a correlate, weakly equivalent, connectionist 
parser It 1s silent, however, about the relation between these two 
correlate connectionist parsers In particular, it 1s silent about 
whether the correlate connectionist parser that handles ‘despises’ 1s 
in any sense a natural extension of the one that does not For all the 
SLM result shows, the connectionist parser that handles ‘despises’ 
would have to be built from scratch and not a simple modification of 
the one that does not By itself, the SLM result does not show that the 
networks corresponding to the pre- and postlearning stages lie on 
any common connectionist learning trajectory It shows only that any 
set of parses that can be captured by a classical system can be 
captured by a connectionist network This at least suggests the pos- 
sibility that the connectionist network does not parse the systematic 
variants of S because they are systematic vanants of an acceptable sentence, 
but rather that 1t parses them simply because they are among the 
sentences in the target set defined by the classical system to be 
emulated We have, in short, no reason to think that all the systematic 
variants of S are parsed in the same way Rather, the prediction that 
the network will parse a systematic variant of Sis derived simply from 
the facts (1) that the systematic variants of S are acceptable 1f S 1s, and 
(11) that the network parses all the acceptable sentences !? 


12 This problem 1s an instance of a more general problem we address elsewhere 
and will be pursued in further publications See Pierre Poirier, Cummins, James 
Blackmon, David Byrd, Martin Roth, and Schwarz, “The Epistemology of Non- 
symbolic Cognition Atomistic Learning and Forgetting” (Tech Report Phil99-3, 
University of California/Davis, 1999), see also Michael McCloskey and Neil J 
Cohen, “Catastrophic Interference 1n. Connectionist Networks The Sequential 
Learning Problem," The Psychology of Learning and Motwation, xxiv (1989) 109-65 
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We shall come back in a later section to the merits (and demerits) 
of the objection that connectionist data coverage 1s unprincipled But 
notice that, at this point, the debate has shifted from empirical 
considerations of what best covers the data to philosophical consid- 
erations of what constitutes a principled explanation We propose, 
therefore, to reformulate the issue in the hope of finding a way of 
retaining its essentially empirical cast 


H THE ISSUE REFORMULATED 

The fact that a system cognizes a domain will manifest itself ın a 
variety of psychological effects By an effect, we mean a nomic regu- 
larıty in behavior (what Ruth Garrett Millikan’ calls a law «m stu, that 
1s, a law that holds of a system in virtue of the special structure and 
organization of that system) For instance, the fact that humans 
cognize the color domain 1s manifested by the fact that humans make 
such and such discriminations (and fail to make others), by the fact 
that they will eat some foods and not others, and so on 

We take 1t as uncontroversial that some domains are cognized by 
grasping their underlying structure For example, we can recognize 
the melody of “Mary Had a Little Lamb" whether 1t 15 played by an 
orchestra or on a kazoo, regardless of what key 1t 15 played 1n, and, 
within limits, regardless of tempo Sensitivity to melody across differ- 
ences 1n timbre, tempo, and key suggests that we process a melody by 
processing information about the structure and arrangement of 
notes 1n a composition Word problem-solving 1n algebra 1s another 
case where grasping the underlying structure of a domain 15 required 
for cognizing the domain Students are better able to solve word 
problems involving distances, rates, and times when they see the 
problems as having the same structure as problems with which they 
are already familiar For example, students who can solve problems 
involving wind speeds but have difficulty solving problems involving 
current speeds are able to perform well on the latter when it 1s 


Connectionist systems using distributed representation cannot learn atomically, 
that 1s, they cannot add single representations, such as the predicate ‘despise’, to 
their belief box or knowledge base without having to relearn everything they 
previously learned Surely, this ıs not the way we acquire new representations in 
many domains (But, for a possible solution, see James L McClelland, Bruce L 
McNaughton, and Randall C O'Reilly, "Why There Are Complementary Learning 
Systems in the Hippocampus and the Neocortex Insights from the Success and 
Failures of Connectionist Models of Learning and Memory,” Psychological Review, Cir 
(1995) 419-57 
18 Language, Thought, and Other Biological Categories (Cambridge MIT, 1984) 
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pointed out that current 1s analogous to wind 14 Only when students 
recognize an underlying structure with which they are familiar do 
they seem able to give the correct answer (this suggests that sensitivity 
to structure 1s learned 1n some cases, or at least affected by prior 
knowledge) We also take it to be uncontroversial that some domains 
are cognized without grasping any significant underlying structure 
knowing the capitals of twenty states in the United States 1s no help 
in divining the capital of an unstudied state Our ability to learn the 
state capitals does not depend on being sensitive to underlying 
structural features of states and their capitals With respect to the 
previous distinction, we call an effect a systematwity effect i£ 1t 1s a 
psychological effect that implies sensitivity to the structure of the 
domain cognized 1° 

Is ıt possible to draw any conclusions about the form of mental 
representation from the presence of systematicity effects? We think it 
1$ Recall that our mam methodological objective here ıs to keep the 
issue at an empirical level in the hope that it can be solved there Our 
first conclusion proceeds from the observation that not all systema- 
ticity effects are created equal some may be computed by the system, 
and some may be incidental effects of whatever algorithm 1s being 
computed We call the first type of systematicity effect primary and the 
second zncidental, and m the next section we address the distinction 
and what ıt tells us about mental representation !6 Our second con- 
clusion rests on a finer analysis of the SLM tensor-product represen- 


14 Denise Cummins, "Role of Analogical Reasoning in the Induction of Problem 
Categories," Journal of Experimental Psychology Learning, Memory, and Cognition, v 
(1992) 1103-24 

15 These examples are meant to represent the extreme cases There are hkely to 
be cognitive tasks that require grasping varying degrees of structure depending on 
the domain, so the distinction between structured and unstructured 1s not all or 
nothing Also, some cognitive tasks will be hybrid cases, where some aspects require 
grasping structural information while others do not Cognizing language appears to 
be an example, since mastery of the primiuve lexicon 1s surely more like learning 
the state capitals than like mastering the syntax 

One must also be careful not to confuse structure in the domain with structure 
m the way a problem or question 1s posed When a teacher requires that I learn the 
capital of California, the structure of my answer ıs determined in part by the 
structure of the question itself. Similarly, one can imagine a state capital learning 
device that represents states as one place predicates such that only one capital could 
be matched with any state In this case the structure of the answer is partly 
determined by the way states are represented in the system 

16 The distinction between primary and incidental systematicity effects ıs intro- 
duced in Cummins, “Systematicity,” and also discussed in Cummins, “‘How Does It 
Work” versus "What Are the Laws? Two Conceptions of Psychological Explana- 
tion,” in Frank C Keil and Robert A Wilson, eds, Explanation and Cognition 
(Cambridge MIT, 2000), pp 117-44 
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tations Unlike classical representations that preserve information 
about the structure of represented elements by actually sharing that 
structure, tensor-product representations do not share structure 
with what they represent yet still manage to preserve structural 1n- 
formation and make it available to processors This allows a possible 
solution to a problem raised by Cummins?” against classical repre- 
sentations—namely, that they cannot possibly share structure with 
every domain m which we find systematicity effects— but the price for 
that solution may be one Fodor, Pylyshyn, and McLaughlin are not 
ready to pay We address this issue 1n section 1v 


III PRIMARY VERSUS INCIDENTAL SYSTEMATICITY EFFECTS 

The distinction between primary and incidental effects captures the 
important fact that a system's behavior results not only from the 
function ıt computes, but from a variety of other factors as well 
Compare a system that multiplies by partial products with a system 
that computes products through successive addition What the two 
systems have in common is the multiplication effect, that 1s, the fact that 
they both produce the same products from the same arguments 
They differ, however, ın how long it takes them to compute the value 
for a given argument In the case of the successive adder, the re- 
sponse time 1s roughly proportional to the size of the multiplier 
computing ten times some number will take approximately twice as 
long as computing five times that number The other system, in 
contrast, does not display such a linearity effect Its response time 15, 
roughly, a step function of the number of digits in the multiplier, 
indicating how many partial products need to be added in the end 

The two systems have the same primary systematicity effects, since 
every argument pair leads to the same value in both systems they are 
both multiphers But their incidental systematicity effects are differ- 
ent For instance, the partial product multiplier will take approxi- 
mately the same amount of time to multiply N by 10 as 1t will to 
multiply N by 99, whereas the successive addition multiplier will take 
roughly 10 tumes longer to multiply N by 99 as ıt will to multiply N by 
10 In general, two systems that compute the same function using 
different algorithms will display different incidental effects, although 
they will be weakly equivalent in Pylyshyn's sense 18 

One important source of incidental effects thus lies ın the algo- 
rithm producing the primary effects But systems that implement the 
same algorithm may exhibit different incidental effects If the under- 


17 “Systematicity ” 
18 Computation and Cognition (Cambridge MIT, 1984) 
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lying hardware 1s sufficiently different, they may operate at greatly 
different speeds, as 1s familiar to anyone who has run the same 
program on an Intel 486 running at 50 MHz and on a Pentium III 
running at 750 MHz A significant difference in response time be- 
tween two systems, in other words, does not entail that they 1mple- 
ment different algorithms, 1t may be a direct result of a difference at 
the implementation level Hence, two systems that have identical 
primary effects may exhibit different incidental effects either because 
they compute different algorithms or because the identical algo- 
rithms they compute are implemented in different hardware 
Incidental effects are not restricted to the complexity profiles of 
the algorithm or the constraints imposed by the implementing matter 
on the execution of the algorithm Continuous operation will heat up 
the calculator, an incidental effect that 1s normally irrelevant to the 
system's performance Nonetheless, when things get too hot, the 
system will start malfunctioning and ultimately break down As a 
consequence, two systems that differ ın performance do not neces- 
sarily compute different functions, one of the systems may simply 
have been subject to extraneous factors, such as overheating, fatigue, 
or attention deficit disorder By the same token, the general physical 
makeup of a system also determines the extent to which environmen- 
tal conditions will have an impact on the system's ability to operate 
The occurrence of a strong magnetic field, for example, will interfere 
with the operation of an electronic calculator but not with that of an 
abacus In the case of single-purpose machines, this last kind of 
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incidental effect ıs primarily relevant for explaming why a system fails 
to display its primary effects on certain occasions But in the case of 
more complex systems, such effects have proven useful as an explan- 
atory tool as well Consider the fact that cognitive processing in the 
human brain 1s generally correlated with increased metabolic activity, 
a fact that has proved critical for the use of imaging technology (for 
example, PET, fMRI) that studies which cortical areas are involved ın 
the processing of these tasks Yet metabolic activity as such is not 
specific to the neural implementation of whatever computations are 
performed 

It ıs evident that the effects of implementation details and environ- 
ment complicate ferences from sameness of incidental effects to 
sameness of underlying functional architecture In what follows, how- 
ever, we shall make use only of the inference from differences ın 
incidental effects to differences in functional architecture 1n cases in 
which mmplementational and environmental influences are not at 
issue 

Now, Smolensky, Legendre, and Miyata have proven that a con- 
nectionist tensor-product parser and the classical parser can exhibit 
the same primary effects (they are weakly equivalent, like our two 
multiphcation algorithms). Any parse a classical parser computes can 
be computed by a corresponding connectionist parser If systematic- 
ity effects are primary effects, then Smolensky, Legendre, and Miyata 
have demonstrated, mathematically, that systematicity effects ın lan- 
guage parsing can be accounted for without any appeal to classical 
representations Hence, if the systematicity effects at issue are primary 
effects, then nothing can be concluded concerning whether the 
mental representations involved are classical or connectionist in 
form As we saw, the consequence of that empirical deadlock has 
been to turn away from empirical considerations to philosophical 
issues concerning what constitutes a principled explanation The 
SLM tensor-product parser employs a different algorithm than the 
classical parser, however, and thus the two systems are bound to 
exhibit different incidental effects If the systematicity effects ob- 
served are incidental, then, barring cosmic coincidence, at most one 
of the two parsers can account for them In particular, the classical 
explanation can gain leverage over the SLM explanation if the form 
in which information 1s represented ıs important to explaining the 
effect An incidental effect of using classical representations in lan- 
guage processing might, for imstance, involve priming Since the 
lexical items and sentential structure of a recently processed sentence 
S may be easily accessible (they may still be in short-term memory, for 
instance), the recent processing of S may allow the system to process 
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systematic variants of S faster than ıt would process sentences that are 
not systematic variants of S An incidental effect of using connection- 
ist representations 1s the flat temporal profile of parsing with a 
tensor-product parser Since the latter will process all sentences, no 
matter how complex, in one step, it will process all of them in exactly 
the same amount of time Should the empirical evidence show that 
subjects exhibit the kind of priming effect described above or a 
temporal profile different from the one we would expect from the 
use of a tensor-product parser, then, weak equivalence notwithstand- 
ing, Fodor, Pylyshyn, and McLaughlin have a good argument in favor 
of their claim that classical representations better explain the syste- 
maticity of thought Of course, should the evidence show that the 
incidental effects are those one would expect from an SLM-type 
parser, then 1t would seem that classical representations are not the 
source of the systematicity of thought 

Fodor, Pylyshyn, and McLaughlin and their supporters cite no 
evidence from incidental systematicity effects, nor do their oppo- 
nents It seems likely that there are relevant effects reported ın the 
literature on language processing, but they have not, to our knowl- 
edge, been brought to bear on this issue !? It should be emphasized, 
however, that systematicity effects of the sort Fodor, Pylyshyn, and 
McLaughlin had ın mind are clearly not incidental but primary?? 
they do not argue from evidence that processing ‘Mary loves John’ 
makes it easier to process ‘John loves Mary’ to the conclusion that 
mental representation 1s classical They simply argue from the avail- 
ability of a given thought (sentence understood) to the possibility of 
its systematic variants 


IV REPRESENTATIONAL PLURALISM AND STRUCTURAL ENCODINGS 
The inference from systematicity effects in language to classical rep- 
resentations involves three steps 


(1) The observation of systematicity effects in language 

(2) An inference from the presence of these effects to the conclusion 
that mental representations must preserve and carry information 
about the structure of the domain 

(3) An inference from that conclusion to the further conclusion that 


19 We are in the process of searching the literature for such evidence Meanwhile, 
the fact that the SLM parser accomplishes all parses in a single step, regardless of 
the complexity of the sentence parsed, 1s surely suggestive 

20 This ıs put misleadingly in Cummins, "Systematicity," p 603 The point is that 
proponents of the systematicity argument are thinking of systematicity effects as 
primary when they should be thinking of them as incidental 
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the information about structure must be carried by classical repre- 
sentations 


Most will readily agree that systematicity effects in language can be 
observed We noted in the previous section that care should be taken 
to distinguish primary from incidental systematicity effects in lan- 
guage since only the latter will allow conclusions to be drawn about 
the nature of representations Step 2 1s also uncontroversial In 
unbounded domains (see above), how else can these effects be 
produced? It 1s step 3 of the inference, from the preservation of 
structural information to the necessary presence of classical repre- 
sentations, that we wish to address here 

Sensitivity to the structure of a domain is best explained by a 
scheme of mental representation that carries information about the 
structure of that domain One such scheme involves structural rep- 
resentations of the domain in question A representation R ıs a 
structural representation of its target Just 1n case R and its target share 
structure Scale models, photographs, and maps are typical examples 
of structural representations As Cummins?! points out, ıt is natural to 
explain the systematicity effects we find in language, vision, and 
audition by positing structural representations of each of the respec- 
tive domains The obvious virtue of structural representation 1s that 
structural transformations and permutations of the representations 
yield representations of the systematic variants of the corresponding 
targets in the cognized domain For every operation defined over 
elements in a domain there can be a corresponding operation de- 
fined over elements ın the representational scheme Thus, the rep- 
resentation of an item in the domain can be used to construct a 
representation of its systematic variants The sensitivity to structure 
that ıs required to cognize certain domains 1s accomplished by actu- 
ally having the structure of the domain in the representations We are 
now 1n a position to appreciate that the FPM classical representations 
are a case of a scheme of structural representation for the linguistic 
domain The 1dea ıs that, in order to be sensitive to the combinatorial 
syntax and associated semantics of a language, there must be a system 
of internal representations that has the same (or corresponding) 
syntactic and semantic features 

This way of explaining systematicity effects, however, limits classical 
schemes to the explanation of systematicity effects exhibited by the 
cognition of language-like domains, since these are the only domains 
that classical schemes can structurally represent This leaves the 
systematicity effects we find in other differently structured domains to 
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be explained by appeal to nonclassical schemes Since we apparently 
grasp the underlying structure of domains structurally distinct from 
language, we would require structurally distinct representational 
schemes to cope with each of them in the same way that classical 
schemes are supposed to facilitate coping with language Apparently, 
structural representation comes at the price of (perhaps massive) 
representational pluralism 

The problem of representational pluralism was first noticed by 
Cummins (bid) Roughly, representational pluralism ıs the idea that 
for every differently structured domain for which we show a systema- 
ticity effect, we employ a scheme of mental representation that shares 
structure with that domain Thus, if we exhibit systematicity effects in 
cognizing three differently structured domains, we employ at least 
three differently structured schemes of mental representation Fodor, 
Pylyshyn, and McLaughlin can make a plausible argument from 
systematicity effects in language processing to the conclusion that 
some mental representation 1s classical There are other systematicity 
effects, however, such as those found in vision and audition, that 
cannot be accounted for by structural representations if all mental 
representation 1s classical, since the structure of the domains in 
language, audition, and vision are different Since the inference from 
systematicity effects in domains other than language to nonclassical, 
structural representations 1s on par with the inference from systema- 
ticity effects in language to classical representations, the systematici- 
ties 1n vision and audition are good evidence for some nonclassical, 
structural representations Thus, if the FPM inference 1s sound, it 
constitutes a good argument for representational pluralism The only 
way Fodor, Pylyshyn, and McLaughlin can retain the idea that mental 
representation 1s monistic 1s by allowing that some systematicity ef- 
fects can be adequately explained without recourse to a correspond- 
ing scheme of structural representations Some systematicity will have 
to be explained by appeal to mere encodings An encoding of a 
domain Dıs a mapping of the members of D onto the representations 
in a scheme R whose members do not share the structure of their 
images ın D Classical representations structurally represent linguistic 
structure, but they only encode the structure of music By allowing 
some encoding, however, friends of Fodor, Pylyshyn, and McLaughlin 
would forfeit the objection that connectionist data coverage 1s un- 
principled, since encoding, on their view, forces us to be unprinci- 
pled somewhere So it seems that either the objection must go, or 
Fodor, Pylyshyn, and McLaughlin are forced to accept representa- 
tional pluralism 

The SLM parser shows, however, that representational monism 1s 
compatible with an adequate account of primary systematicity effects 
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To see this clearly, we need a taxonomy of representational schemes 
that (1) makes evident what property holds of structural representa- 
tions m virtue of which they account for primary systematicity effects, 
and (n) 1dentifies an alternative sort of scheme that can be seen to 
have this critical property 

Our taxonomy relies on two distinctions The first distinction has to 
do with whether a representational scheme represents 1ts domain by 
making available representations that share structure with the items 
of the domain represented Shared structure involves two things 
First, the representations have constituents that represent the con- 
stituents of the domain Second, these representational constituents 
are structurally related 1n a way that represents the ways in which the 
constituents of the content are structurally related We call any 
scheme in which representations represent items m 1ts target domain 
in virtue of sharing structure with the things they represent a struc- 
tural representational scheme for that domain We call schemes that 
represent a domain, but not in virtue of shared structure, an encoding 
of the domain So the first distinction separates entities that represent 
in virtue of shared structure, that 1s, structural representations, from 
those which represent by some other means, that 1s, encodings 

The second distinction we need requires the notion of a recovery 
function A recovery function 1s a function from a representational 
scheme to a domain Intuitively, these functions allow one to recover 
contents from representational schemes Recovery functions are not 
necessarily employed by cognitive systems, they may remain purely 
theoretical constructs with which we may interpret representational 
schemes We distinguish between recovery functions which are sys- 
tematic and those which are not A recovery function 1s systematic 1f 
there 1s a general/productive algorithm for mapping the scheme to 
its contents In such a case, the function does not need to be defined 
by a list of pairs each containing an element of the scheme and a 
content If the recovery function 1s systematic, then the representa- 
tional scheme 1s structural If the recovery function 1s not systematic, 
then the representational scheme 1s pure or arbitrary 
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This taxonomy allows us to distinguish three categories of repre- 
sentational entities structural representations, structural encodings, 
and pure encodings Schemes that belong to either of the first two 
categories are such that the information about the structure of the 
domain is systematically recoverable This 1s relevant because, if 
structure can be recovered systematically from a scheme, the scheme 
has preserved information about the structure of the domain The 
possibility of systematic recovery implies the preservation of structure 

One obvious way of preserving structure 1s for the representational 
scheme simply to share the structure of the domain This 1s what 
structural representations do But this 1s not the only way Structural 
encodings preserve structure without sharing ıt Godel numbering 
and tensor-product schemes are both examples of structural encod- 
ings In the Godel scheme for encoding sentences, words are assigned 
natural numbers while their positions ın the sentence are assigned 
prime numbers in ascending order A number m in position n yields 
the number n” We can say that n” stands for the word numbered m 
standing in the place numbered n This number ıs uniquely factor- 
able into n, m times The Godel string of a sentence ıs determined by 
multiplying all these uniquely factorable numbers together This 
yields a number, expressed by a Godel string, which 1s uniquely 
factorable into a list of numbers An example of a Godel string ıs 
‘1,093,500’ which factors into 2? X 37 X 5? which breaks down further 
into a list of 2s, 3s, and 5s The values of these numbers have been 
assigned to places, and the number of occurrences of any number has 
been assigned to words We may suppose the string encodes the 
sentence ‘Mary loves John’ where ‘Mary’, ‘loves’, and ‘John’ are 
assigned the numbers 2, 7, and 3, respectively The Godel string itself 
does not share structure with its content, however, this structure 1s 
systematically recoverable and thus the representation has preserved 
information about the structure of the sentence The SLM tensor- 
product representations have the same property The tensor-product 
activation patterns themselves do not share structure with their con- 
tents, but the existence of recoverability functions for such schemes 
entails that such schemes preserve structure 

Structural encodings permit recovery but do not share structure 
with their domains Pure encodings do not permit systematic recovery 
and so do not preserve structure If, for example, we were to take the 
names of the state capitals as encodings of the names of the states, we 
would be employing a pure encoding scheme, for there 1s no system- 
atic way by which the names of the states could be recovered from the 
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names of the capitals ?? In this case, a look-up table 1s required, and 
no information about structure 1s preserved ? 

Structural encodings, as opposed to pure encodings, are adequate 
to account for primary systematicity effects because, under such 
schemes, information about structure 1s preserved and that 1s all that 
1s required This 15, ın fact, the property of structural encodings and 
structural representations in virtue of which ıt 15 possible to construct 
architectures using either one that exhibits systematicity effects To 
take our previous example, any system capable of processing 
1,093,500 (the Godel string for ‘Mary loves John’) 1s also capable of 
processing 437,400 (the Godel string for ‘John loves Mary’) The 
parallel with the previous symmetric phrase markers 1s perfect The 
SLM tensor-product scheme 1s a structural encoding, not a pure 
encoding as Fodor, Pylyshyn, and McLaughlin seem to assume, and 
this ıs how Smolensky, Legendre, and Miyata are able to prove their 
otherwise puzzling result It ıs only when cognitive systems employ 
such structure-preserving schemes that they can be causally sensitive 
to the structure of the domain cognized and thus exhibit systematicity 
effects Systems employing pure encodings cannot exhibit such ef- 
fects because there ıs no common form or structure to which the 
system could be causally sensitive 

Once we see that structural encoding 1s adequate to account for 
systematicity effects—at least the primary ones—we are free to cleave 
to representational monism Since a tensor-product scheme like that 
employed by Smolensky, Legendre, and Miyata need not share struc- 
ture with the domains it represents in order to account for primary 
systematicity effects, such a scheme could, 1n principle, account for 
primary systematicity effects in a variety of structurally distinct do- 
mains The same point can be made about classical representation 
since 1t can be used structurally to encode domains such as music that 
are structurally unlike language, a classical scheme could account for 
primary systematicity effects in nonlinguistic domains Since classical 
and nonclassical structural encodings are evidently on equal footing 
in this respect, we conclude again that there 1s no sound argument 
from primary systematicity effects in language to classical represen- 
tation There 1s, of course, a sound argument from systematicity 
effects against pure encoding, but that should come as no surprise 
What ıs important to systematicity 15 preserving information about the 
relevant structure of the target domain It does not much matter how 


?? Another way to see the same point given à new capital name, there 1s no way 
to infer what state name it corresponds to 
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that information 1s encoded, provided an architecture exists that 1s 
capable of exploiting the information in that encoding Smolensky, 
Legendre, and Miyata demonstrate that connectionist networks can 
exploit such information as encoded ın tensor-product activation 
vector schemes 
The staunch defender of classical representations may object at this 
point that there still is a methodological virtue 1n accepting classical 
representations-plus-pluralism over encodings-plus-monism, because 
it 1s the lawfulness of systematicity that needs to be explained and 
encodings just do not account for it But it not clear that they do not 
We agree that, with pure encodings, there could be minds that 
encode ‘Mary loves John’ without encoding ‘John loves Mary’, since 
the ability to encode the former does not imply the ability to encode 
the latter With structural encodings such as the SLM tensor-product 
scheme, however, matters are different Since the encodings are 
generated recursively from the filler and role vectors, the ability to 
encode ‘Mary loves John’ does imply the ability to encode ‘John loves 
Mary’, since the two encodings employ the same filler and role 
vectors The only difference between them 1s how the vectors get 
multiplied and added to create the respective tensor-product encod- 
ings of the two sentences 
ROBERT CUMMINS, JAMES BLACKMON, DAVID BYRD, 
PIERRE POIRIER, MARTIN ROTH, GEORG SCHWARZ 
University of California/Davis 
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PERSONAL IDENTITY, MEMORY, AND CIRCULARITY 
AN ALTERNATIVE FOR Q-MEMORY* 


have many memories of my last summer’s vacation, which I spent 
in the Rocky Mountains For instance, I can recall vividly seeing 
a large Grizzly bear and her two cubs at a specific time and place 
(and distance) Since I was not used to wild animals larger than 
ducks, the event made a lasting impression on me 
Let us assume that, as a matter of objective fact, some person did 
see that bear at that precise time and place Since I do not think I am 
deluded in remembering having seen that bear at that moment, that 
fact inclines me to believe that that person was me This belief seems 
Justified, but 1t can be justified for different reasons It may be the 
case that, under normal circumstances, memory 1s reliable evidence 
for the identity of my present self and the bear-seeing person, how- 
ever this identity 1s 1n fact constituted But it may also be the case that 
memory itself coconstitutes identity over time, that 1s, that part of 
what 1t 1s or means for two time-indexed person-stages to be stages of 
one person 1s that the later stage has conscious access to the earlier 
stage via memory In other words, memory can be either indicative or 
constitutive of personal identity over time 
The latter option 1s generally considered to be implied by a psy- 
chological criterion of personal identity over ime Contemporary 
philosophers! who defend such a criterion hold (1) that personal 
identity over time can be analyzed in terms of psychological continu- 
ity, and (2) that psychological continuity 15 coconstituted by memory 
To be sure, memory ıs not thought to exhaust psychological cont 
nuity, but neither ıs ıt considered to be an optional ingredient In 
fact, contemporary conceptions of psychological continuity evolved 
by adding various kinds of psychological connections. between 
person-stages— connections such as those established by perpetuated 


* Thanks to Marya Schechtman, Jan Bransen, Stefaan Cuypers, Joel Anderson, 
and Michael Smith for helpful comments Work on it was sponsored by the Dutch 
Organization for Scientific Research (NWO) 

TFor example, Derek Parfit, Reasons and Persons (New York Oxford, 1984), 
Sydney Shoemaker, “Persons and Their Pasts,” im American. Philosophical Quarterly, 
virt, 4 (1970) 269-85, Shoemaker (with R G Swinburne), Personal Identity (New 
York Blackwell, 1984), Harold Noonan, Personal Identity (New York Routledge, 
1989), John Perry, “Personal Identity, Memory, and the Problem of Circularity,” in 
Perry, ed , Personal Identity (Berkeley California UP, 1975), pp 135-55, David Lewis, 
“Survival and Identity,” ın Amelie Oksenberg Rorty, ed , The Identities of Persons 
(Berkeley California UP, 1976), pp 1740 


0022-362X/01/9804/186—214 € 2001 The Journal of Philosophy, Inc 


PERSONAL IDENTITY, MEMORY, AND CIRCULARITY 187 


beliefs, values, and character traits or by relations between intentions 
and actions—to John Locke's? memory criterion of identity, this 
criterion 1s held to be too tight, but correct ın spirit 

Since memory ıs considered a (the) main ingredient of psycholog- 
ical continuity, one of the major objections against a psychological 
criterion of personal identity ıs the contention that memory presup- 
poses personal identity This contention, the so-called circularity ob- 
jection, 15 my topic here 

I shall argue that a psychological criterion of personal identity can 
be rescued from the circularity objection only by accepting what has 
not even seemed to be an option to philosophers so far that memory 
1s only indicative rather than constitutive of psychological continuity 
and hence, by a psychological criterion, of personal identity 

In order to do this, I shall claim that the traditional solution to the 
objection, Sydney Shoemaker’s Q-memory (section 1), 1s seriously 
incomplete (section 11) Next, I shall show that there 1s a kind of 
psychological continuity that is not (co-)constituted by memory and 
does not presuppose personal identity (section 11) Memory, I shall 
argue, presupposes—and ıs hence indicative of—this kind of conti- 
nuity (section Iv) By analyzing personal identity in terms of this 
alternative kind of psychological continuity, the circularity objection 
can be avoided (section v) The new solution to the circularity 
objection I shall propose accepts and explains the intuition that 
memory presupposes personal identity—memory presupposes the 
alternative kind of psychological continuity, which, in turn, analyzes 
personal identity—without letting this stand in the way of a psycho- 
logical criterion of personal identity 


I THE CIRCULARITY OBJECTION AND Q-MEMORY 
The circularity objection was originally formulated by Joseph Butler? 
to refute Locke’s (purported) memory criterion of personal identity 
Although it ıs likely that Butler’s point was merely to argue against the 
conflation of epistemic and metaphysical criteria of identity,* the 
objection has been interpreted in far more sophisticated ways since 
then ? One way of presenting the sophisticated version of the objec- 


2 An Essay Concerning Human Understanding [1670], A D Woozley, ed (New York 
Meridian, 1974) Thomas Reid was one of the first to interpret Locke as defending 
a memory criterion, but his interpretation ıs not uncontested, see, for example, 
Marya Schechtman, The Constitution of Selves (Ithaca Cornell, 1996), pp 27-30 

3 “Of Personal Identity” [1736], 1n Perry, ed, pp 99-105 

* Noonan, p 68 

5 See, for example, Parfit, p 220, Shoemaker, p 281 
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tion 1s by means of the distinction between verdical and nonvendical 
memory 

It 1s possible for us to have nonveridical memories as a conse- 
quence of, say, delusion, hypnosis, or having our brains tampered 
with We are capable of seemingly remembering events we never 
witnessed Obviously, such nonveridical memories have nothing to 
do with personal identity, I am not Napoleon when in one of my less 
clear moments I “remember” losing a battle at Waterloo In order for 
a specific memory to coconstitute personal identity, then, that mem- 
ory 1s required to be veridica] Establishing whether or not a memory 
1s veridical means, however, establishing whether or not the person 
who remembers really witnessed the remembered event And that 
just means establishing whether the witness and the rememberer are 
identical Thus, personal identity seems to be presupposed by veridi- 
cal memory so that, on pain of vicious circularity, the latter cannot be 
a criterion for the former 

The circularity objection (in the general form in which I will 
discuss it here) says that the definition of my having a veridical 
memory 1s 


(1) I am presently in a state as 1f remembering having a specific expe- 
rience 

(2) The content of that state ıs qualitatively identical or very similar to 
a past experience 

(3) I had that experience 


If this definition ıs correct, memory cannot be said—informa- 
tvely—to coconstitute personal identity, since the identity between 
the person stage who had the original experience and the person 
stage who remembers having that experience 1s already presupposed 
by (3) 

Shoemaker (op cit) was the first to come up with a substantial 
solution to the circularity problem In order to introduce his solu- 
tion, let me discuss a peculiarity of the above definition 

Suppose I had experience E (I saw a bear, say) but completely 
forgot about it Later, I am told by a friend about an event that 1s 
identical to the one that caused me to have E My friend did not 
witness any event of the sort, her story ıs made up By telling me her 
fiction, she unintentionally supplied me with every conceivable detail 
of an event I witnessed but forgot about The story triggers my 
imagination, endowing me with vivid images of the event I slowly 
begin to identify with these mages and start to internalize them 
“Mistakenly,” I begin to fit them into my biography The distinction 
between these images and my real memories begins to fade until I am 
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finally under the impression that I really remember having had 
experience E 

In the above case, (1), (2), and (3) are satisfied Yet we are not 
normally inclined to call this a case of experience memory The fact 
that I had the experience and the “recollection” 1s completely coin- 
cidental The mechanism by means of which I acquired knowledge of 
my past 1s absolutely unreliable, to put 1t mildly For by the same 
mechanism I could have had a “memory” of an event I never wit- 
nessed, 1n fact, that would have been much more likely 

The reason why the above mechanism 1s unreliable 1s that my state 
of “recollection” 1s 1n no way dependent upon my actually having had 
the "recollected" experience Our concept of experience memory 
assigns an explanatory role to an original experience in the generat- 
ing of a state of recollection thereof In other words, ın order for me 
really to remember E, E should have been causally active in the 
production of my state of recollection Thus it seems we must add a 
fourth requirement to the definition of veridical memory 


(4) My memory ıs causally dependent, in the right kind of way, on the past 
experience that 1s qualitatively identical or similar to my memory 


Importantly, (4) does not mention ‘I’ Shoemaker's insight was that 
(4) safeguards what (3) was called in to secure That ıs, normally the 
fact that I had experience E secures my veridically remembering E, 
because ın normal circumstances this guarantees a causal-explanatory 
connection between £ and its recollection In other words, (3) nor- 
mally secures (4), while of these two, (4) really matters The point of 
the above example ıs that in abnormal circumstances (3) does not 
secure (4) Shoemaker's (and Derek Parfit’s (op ct, pp 219-22)) 
proposal, therefore, was sumply to drop (3) ın favor of (4) (1), (2), 
and (4) yield a satisfactory definition of veridical memory which does 
not mention ‘I’ and hence does not presuppose personal identity 

One consequence of this solution ıs that, if there are ways of 
causally connecting an expenence had by person a to a recollection 
of that experience had by person 5, then b can be said to veridically 
“remember” a’s experience Since this 1s not in line with our ordinary 
use of the term ‘memory’, Shoemaker has introduced the notion of 
Q(uasi)-memory Person b Q-remembers a’s experience The crucial 
theoretical move here 1s that ordinary memory can be considered a 
subspecies of Q-memory, that 1s, the subclass of causally connected 
experiences and recollections in which both the experience and 
recollection are instantiated ın the (numerically) same brain Since 
Q-memory does not require reference to persons 1n its definition 
((1), (2), and (4)), neither does ordinary memory 
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II THE TROUBLE WITH QMEMORY 

Q-memory 1s impossible to square with a holism of the mental, not 
Just as a controversial functionalist doctrine, but also as an uncon- 
tested part of common-sense psychology—or so I argue A definition 
of memory 1n terms of (1), (2), and (4) needs to be supplemented 
with a principle that secures a level of coherence of a memory with its 
psychological context in virtue of which 1t maintains its contents and 
epistemic entitlements and hence its status as a memory 

Parfit’s example of Q-memory is that of Paul and Jane Paul has 
been to Venice and remembers seeing Palladio’s San Giorgio Jane 
has never been to Venice But after a neurological operation in which 
some of Paul's brain traces are instilled ın her brain, she finds herself 
endowed with what seems like a recollection of looking at San Gior- 
gio from the point of view Paul actually saw it 

Examples such as these are highly artificial ın thew simplicity 9 
They concentrate on experiences and recollections that are describ- 
able more or less in abstraction from other mental contents By doing 
so they obscure serious difficulties with Qremembering other per- 
sons’ experiences that stem from the fact that most experiences and 
memories require an often rather elaborate context to make sense 
Let me give an example to illustrate this point 

Suppose you have just taken your children to school You have 
clear memories of walking the familiar (to you) route in your neigh- 
borhood to their school You remember what your children looked 
hike this day You remember thinking things such as "It's time to buy 
a new coat for Mary," and so on And you remember the familiar sight 
of the school building Now suppose these memories are instilled, by 
surgical means, say, 1n my bram I do not know your children and 
have never visited your neighborhood What would my Q-memories 
of your experiences be like? 

I might have a picture 1n my head of your children walking the 
route they walked and entering their school But the route and the 
school will not evoke feelings of familiarity, and the children will 
seem strangers Marya Schechtman argues that, if I do not change my 
beliefs about the neighborhood and the children not being mine, my 
Q-recollection would be significantly different from yours, so that the 
similarity requirement between experience and recollection ((1) and 
(2)) 1s violated 7 If, on the other hand, I somehow get the belief 


$ See also Schechtman, “Personhood and Personal Identity," this JOURNAL, LXXXVII, 
2 (February 1990) 71-92, and "The Same and the Same Two Views of Psychological 
Conünuity," in Amencan Philosophical Quarterly, xxx1, 3 (1994) 199-212 

" *Personhood and Personal Identity,” pp 79-86 
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mstilled in me that the imagined neighborhood and the children are 
mine, the memory might resemble the original experience, but only 
at the cost of endowing me with a false belief about myself I claim 
that the second of these possibilities is ın fact no option, and that the 
first presents the Q-memory theorist with a problem more serious 
than the absence of qualitative similanty between experience and 
recollection might suggest 

Let us start with the second possibility Artificially endowing a 
person with the required background beliefs meets with a problem 
similar to the one it ıs supposed to solve Just as we cannot endow 
someone with an isolated memory and expect its entire content to fit 
into any new psychological context, so we cannot endow a person 
with a belief just like that 

Suppose that along with the Q-memory I get the belief instilled in 
me that the children I Q-remember taking to school are my children 
The fact that this single belief contradicts an 1mmense number of 
memories and beliefs I have about myself would be more than 
sufficient for me to reject ıt as a delusion ® We might try to remedy 
this by endowing me with a large number of other background 
beliefs, and possibly even erase a few of my actual behefs But beliefs 
have a mind-to-world direction of fit Hence my environment—my 
family, work, neighborhood, ın short “my life"—will put rather severe 
constraints on what 1s rationally admissible for me to believe It would 
be highly coincidental, then, if the background beliefs I am required 
to have did not clash with my environment (think only of the belief 
that the school I Q-remember 1s near to where I live) Similarly, it 
would be extremely unlikely that the actual beliefs of mine that 
should be erased because they contradict either the belief that the 
Q-remembered children are mine or the background beliefs that 
belief requires can indeed all be erased without severely impairing my 
rationality 

In many (quite possibly all) Q-memory cases, then, endowing a 
person with a Q-memory of someone else's experience will result in 
a clash between either this Q-memory or its required background 
beliefs with part of the new psychological context. The problem with 
Schechtman’s second possibility 15 not only that the Q-“rememberer” 
will be endowed with false beliefs The problem is that the incoher- 
ence of a Q-memory with its psychological context cannot be reme- 
died by adding background beliefs, there will always be incoherence 


8 See, for example, Daniel Dennett, "Mechanism and Responsibility," in Gary 
Watson, ed , Free Will (New York Oxford, 1982), pp 168-69 
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And that leaves us with the first possibility the Q-recollection (with or 
without added background beliefs) ıs more than likely to differ 
substantially from the original experience 

If all a clash between a Q-memory and its new psychological con- 
text results 1n, however, 1s the fact that the Q-memory will not be 
qualitatively similar to the original experience with which it 1s causally 
connected, this need not yet deter the Q-memory theorist The 
qualitative similarity of the complete experience and recollection 1s not 
sacrosanct ? It may easily be admitted that only part of a Q-recollec- 
tion resembles the original experience due to the difference in 
psychological context of the two mental states In fact, 1f what seems 
like a memory does not entirely fit its psychological context, so to 
speak, this may be taken as an indication that one 1s Q-remembering 
someone else's experience, provided one knows such a thing to be 
possible 19 

But the displacement of a Q-memory 1n an "alien" psychological 
context also results in the loss of content and epistemic merits of a 
memory, or even the impossibility of there being something that 
resembles a memory state at all, depending on what theory of mind 
one adopts Let me elaborate on that 

The worst-case scenario for the Q-memory theorist follows from the 
kind of holism with regard to mental content which ıs pervasive in 
most versions of what may well count as the best established theory of 
mind, functionalism, but also ın such theories as interpretationism or 
anomalous monism !! According to this holism, inferential and evr- 
dential interconnections between propositional attitudes (among 
which are the beliefs to which memories give rise) are coconstitutive 
of their contents If such holism were true, many cases of Q-memory 
would not Just be cases 1n which the Q-memory differs qualitatively 
from the experience that caused it, rather, they would be cases in 


? It need not count against a theory of memory to drop the requirement that an 
experience memory should resemble an original experience 1n its entirety In fact, 
the requirement 1s too strong anyway If my recollection of seeing a bear last 
summer vacation was accompanied by the thought I had then—“I am seeing a bear 
now”—I am not remembering that event but having an illusion ` If, on the other 
hand, my memory 1s accompanied by the thought "I saw this bear so many months 
ago," Iam having a memory of that event even though the contents of my mind do 
not completely resemble those at the tme when I saw the bear See section Iv 

19 See also John McDowell, “Reductionism and the First Person,” in Jonathan 
Dancy, ed , Reading Parfit (Cambridge Blackwell, 1997), pp 240-41 

11 On “interpretationism,” see, for example, Dennett, The Intentional Stance (Cam- 
bridge MIT, 1987), and on “anomalous monism," see, for example, Donald David- 
son, “Mental Events," ın Essays on Actions and Events (New York Oxford, 1980), pp 
207-25 
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which there would be no similarity in content whatsoever between the 
newly inserted brain state and its causal ongin If inferential and 
evidential connections constitute content, then the clash between a 
Q-memory and its newly acquired psychological context will result in 
the Q-memory's having a nonsensical content or, rather, no content 
at all 

Moreover, to insert a Q-memory in a brain will be to tamper with 
the structure of its causal network And since, according to the 
theories under consideration, propositional attitudes are global states 
of such a network, to tamper with this network 1s to distort not Just the 
content of the inserted memory but also the contents of a whole host 
of other global states If constetuteve holism (as we may call 1t) 1s true, 
individual memones are not linked to their original experience by a 
separate causal connection Rather, the whole frame of mind by 
which a recollection 1s coconstituted 15 linked to the whole frame of 
mind by which the original experience was coconstituted, without it 
being possible to single out the causal connections that link individ- 
ual mental states If constitutive holism ıs true, Q-memory 1s a myth 

But constitutive holism 1s not uncontroversial 12 Theories of mind 
such as substance dualism, type-identity theory, or (versions of) com- 
putationalism,'® are more inclined toward some kind of psycholog- 
cal atomism Thus it seems they may allow for cases of Q-memory 

There ıs a much less strict mental holism, however, that 1s not part 
of a theory of mental content but rather part of our common-sense 
notion of how mental states interrelate, 1t 15 part of the explanandum 
rather than explanans, so to speak Thus is the holism according to 
which a person can hold only so many incompatible beliefs before 
she can no longer be said to possess the rationality that 1s required for 
the ascription of beliefs and desires to be explanatorily profitable— 
even though such lack of rationality would not make mental-state 
ascriptions senseless, as they would be in the case of constitutive 
holism It 1s the common-sense holist who says that I can believe that 
Xonly if I have the background knowledge required to make sense of 
‘X’ and to tell under what circumstances ‘X’ 1s true or false Such 
holism ıs ın no way precluded by dualism, typeidentity theory, or 
computationalism 


12 See, for example, Jerry Fodor and Ernest Lepore, Hohsm A Shoppers Guide 
(Cambndge Blackwell, 1992) 

13 On “type idenüty," see, for example, JJ C Smart, "Sensations and Brain Pro- 
cesses," ın VC Chappell, ed , Philosophy of Mind (Englewood Cliffs Prentice-Hall, 
1962), pp 160-72, on "computationalism," see, for example, Fodor, A Theory of 
Content and Other Essays (Cambridge MIT, 1990) 
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Common-sense holism does not oppose Q-memory the way consti- 
tutive holism does But it does undermine the (epistemic) potential 
of Q-memory Just as a Q-memory is likely to clash with part of its new 
psychological context, so it 1s likely that the new psychological con- 
text does not contain all the resources that are required for the 
Q-memory to retain its full content and epistemic entitlements For 
instance, the richness of your experience in taking your children to 
school 1s lost on me Moreover, due to the absence of relevant 
background knowledge, the information to which this Q-memory 
gives me access 1s very limited compared to your own recollection of 
the experience For example, while you remember Jim and Mary 
entering their regular school (with its history ın their lives, and so 
on), all I Q-remember ıs some kids entering a school-like building 
they might as well be seemg for the first ume Similarly, you may 
remember your daughter to be uncharacteristically sad that day All 
I Q-remember 1s that there was a sad girl 

So, even if I recognize my Q-memory of your experience as a 
Q-memory and do not discard it as an illusion, ıt will not have all the 
epistemic entitlements ıt has to you. Importantly, this ıs due simply to 
the new psychological context, no causal connection can make up for 
the loss ın epistemic value To this, the Q-memory theorist may reply 
that, even 1f Q-memory ıs unlikely to carry as much information as 
ordinary memory, this does not mean that it carries no information 
at all While Q-remembering your experience of walking your chil- 
dren to school, I may conclude that there were children who looked 
a certain way, who were walking toward a building that looked like a 
school while being accompanied by someone whose perspective I 
relive, and so on Surely, my drawing these conclusions requires me 
to know that Q-memory 1s possible, but that 1s not specifically prob- 
lematic 

Although correct, this reply misses a crucial pomt The informa- 
tion carried by the Q-memory which 1s not lost on me 1s accessible 1n 
virtue of the contingent fact that my mental make-up 1s such that ıt 
allows me to access ıt There is nothing ın the causal connection 
between your experience and my Q-recollection that secures there 
being an appropriate context to access the information that 1s trans- 
mitted by ıt If a Q-memory of your experience were instantiated in 
the brain of a member of some exotic tribe that has not had con- 
scious contact with Western civilization, virtually all information con- 
tamed in the memory would be lost on that person The person 
would simply not know what she 1s Q-remembering, even if she knew 
that she is Q-remembering (If this example 1s not convincing, we can 
always bring out the Martians, but I trust I shall not have to ) 
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It follows from common-sense holism, then, that the contents and 
epistemic merits of a (Q-) memory are just as much dependent upon 
the memory’s psychological context as they are on the causes that 
produce the brain state upon which they supervene Parfit's suppo- 
sition—that only the causal origins of a memory count—presupposes 
a psychological atomism that can be sustained only by focusing 
exclusively on the phenomenal aspects of memories, as 1s done ın the 
Paul and Jane case But such atomism cannot be sustained when the 
conceptual side of memory 1s taken into account Jane, for instance, 
can Q-remember Paul's experience only as an experience of Palla- 
dio’s San Giorgio in Venice when she knows who Palladio was, what 
and where Venice 1s, and so on And because the very contents of 
memories depend on their psychological context, so do their epis- 
temic entitlements 

To conclude, on common-sense holism, (1), (2), and (4) do not 
make for a complete definition of memory Apart from a causal 
connection between an experience and a recollection, an appropri- 
ate context for the recollection needs to be secured which allows the 
memory to retain some or most of its content and hence its epistemic 
merits Part of such a context might be expected to exist in many 
cases of Q-memory, simply because, 1n general, people share a large 
portion of their background beliefs, if only because they inhabit the 
same world But this is a contingent fact that ıs not ın itself secured 
by the presence of the required causal connection between experi- 
ence and Q-recollection 

Constitutive holism yields a similar, though stronger conclusion 
On this theory, a causal connection between an individual experience 
and an individual recollection 1s also not sufficient for memory Since 
neither the experience nor the recollection can be individuated in 
abstraction from its respective psychological context, a definition of 
experience memory requires a principle that secures the presence of 
the appropriate recollection context, given what the experience con- 
text was like 

What both kinds of holism imply 1s that a definition of memory in 
terms of (1), (2), and (4) needs to be supplemented by some kind of 
context-continuity requirement a principle that explains and se- 
cures—rather than presupposes—the presence of a psychological 
context of a recollection which is related to the context of the 
original experience such that 1t allows the recollection to retain a 
sufficient portion of the original experience’s content and hence 
have sufficiently enough epistemic merit to count as memory Even 
without an answer to the difficult question of what counts as “suffi- 
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cient” and "sufficiently enough,” the incompleteness of Q-memory 1s 
plain 

In normal circumstances, the identity (or spatiotemporal continu- 
ity) of a body and brain secures (1), (2), (4) and the context- 
continuity requirement that cannot be captured by these three This 
fuels the suspicion that memory defined in terms of (1), (2), and (4) 
in fact presupposes identity of the brain, or of the brain and body 
Proponents of a bodily criterion of personal identity, but especially 
proponents of a brain criterion,'* will find themselves in a rather 
good position to claim that, even on a causal definition of memory, 
memory does presuppose personal identity 

In what follows, I argue that a psychological criterion of personal 
identity need not be rejected on the basis of this circularity objection 
What memory presupposes, and what fills in the details of the con- 
text-continuity requirement should be spelled out ın terms of a type 
of psychological continuity rather than in terms of the unity of the 
brain or brain and body This may seem to be an odd claim, given 
that psychological continuity itself has always been analyzed at least 
partly ın terms of memory I claim, however, that psychological 
continuity in the required sense 1s not constituted by memory, mem- 
ory 1s an epistemic, rather than an ontological criterion for it 


III TWO KINDS OF PSYCHOLOGICAL CONTINUITY 

The circularity objection. (in its contemporary guise) says that psy- 
chological continuity cannot analyze personal identity, since (A) 
psychological continuity consists of overlapping chains of psycholog- 
ical connectedness (in a sense to be explained below), and (B) the 
most 1mportant kind of psychological connectedness, memory, pre- 
supposes personal identity The Q-memory solution to this objection 
attempts to deny (B) I argue against (A) that psychological continu- 
ity cannot be analyzed ın terms of psychological connectedness 

Psychological continuity as I characterize ıt does not presuppose 
personal identity Hence ıt can possibly analyze 1t—without circular- 
ity Memory presupposes psychological continuity, I claim, for one 
thing because this provides the “context continuity” memory re- 
quires Since psychological continuity might analyze personal 1den- 
tity, memory presupposes personal identity But there ıs no harm 
done by that fact For memory itself should be understood as a mere 


14 On “bodily criterion” proponents, see, for example, Bernard Williams, "Per- 
sonal Identity and Individuation,” ın Proceedings of the Áristotehan Society, LVII (1957) 
229-52, and David Wiggins, Sameness and Substance (New York Blackwell, 1980), on 
“brain criterion” proponents, see, for example, Thomas Nagel, The View from 
Nowhere (New York Oxford, 1986) 
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epistemic criterion of personal identity That, at least, 1s what I shall 
argue in an attempt to do justice to Butler’s original point, to the 
contemporary circularity objection, and to the neo-Lockean insight 
that personal identity 1s constituted by psychological continuity 

But let me start by indicating how psychological connectedness (for 
example, memory) 1s supposed to constitute psychological continuity 
and what 1s wrong with this supposition 

Detailed analyses of the concept of psychological continuity in the 
current debate are rare Parfit seems to articulate a near consensus 
on this issue when he defines psychological continuity in terms of 
overlapping chains of “strong connectedness” (op at, pp 20409) 
Some elucidation may be required here 

Psychological connectedness denotes relations between mental 
states, such as those between experiences and recollections thereof, 
or between intentions and their being carried out, the continuous 
holding of beliefs and preferences, and the stable persistence of 
character traits It 1s easy to see why psychological continuity theorists 
emphasize such relations of qualitative similarity or identity The 
debate over personal identity 1s a debate over what makes two differ- 
ently timeandexed person-stages stages of the same person When 
defining a psychological criterion of personal identity, what 1s more 
natural than to emphasize what remains the same, psychologically 
speaking, throughout large parts of a person’s life? 

Connectedness cannot analyze personal identity, however, for 
while identity 1s a transitive relation, connectedness obviously 1s not 
Take memory I may now remember my tenth birthday, and on my 
seventieth birthday I may remember writing this paper I may not, 
however, remember my tenth birthday on my seventieth Thus, by the 
criterion of memory, my tenth birthday-self and my seventieth birth- 
day-self are unconnected, even though my present self 1s connected 
to both In order to remedy this deficit, Parfit introduced the notion 
of overlapping chains of strong connectedness 

Two person-stages are stipulated to be strongly connected, 1f there 
are at least half the number of direct psychological connections 
between them which hold in the lives of every normal person over 
one day The concept of psychological continuity, now, 1s defined as 
the holding of overlapping chains of strong connectedness If chains 
of strongly connected psychological states continue to overlap in time 
(the way fibers in a rope overlap in space), there will be psychological 
continuity Thus, if enough direct psychological connections (mem- 
ories, intentions, persistent beliefs, and desires, and the like) overlap 
in my life between my tenth and my seventieth birthday, these two 
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person-stages will be continuous, even if no direct psychological 
connection holds between them 

Characteristic of the Parfitian definition 1s that psychological con- 
tinuity 1s exhaustively defined in terms of psychological connected- 
ness The definition is hardly ever challenged explicitly But as 
an exhaustive definition, it just will not do It ıs absurd to claim 
that the multitude of mental states comprising a person’s mental life 
is held together merely by relations of qualitative similarity and 
co-occurrence (in the same mind) 

Consider Shoemaker “psychological continuity 1s constituted no 
more by [psychological connectedness] than it 1s by evolution and 
execution of plans of action, by deliberation and reasoning, and by 
countless other mental processes" (op ct (with Swinburne), pp 
95-96) His point 1s to include psychological activities such as delib- 
erating and thinking ın the class of continulty-constituting connec- 
tions I shall not be concerned merely with thinking and deliberating 
in what follows Rather, I shall elaborate on an implication of Shoe- 
maker's point that continuity of our mental lives not only consists in 
relations of qualitative similarity between mental states (for example, 
memory), but also and importantly 1n relations between consecutive 
qualitatively nonsimilar states 

Here, I want to concentrate on some of the conditions under which 
the succession of qualitatively distinct mental states yields a kind of 
psychological continuity that cannot be defined or analyzed ın terms 
of Parfitian connectedness As a first abstract characterization of the 
continuity I am after, let me offer the following consecutive qualita- 
tively distinct mental states can be continuous 1n virtue of the fact that 
their order of appearance crucially determines their full content and 
the intelligibility of their occurrence Differently put, a sequence of 
mental states can be continuous when the contents and intelligibility 
of later states in the sequence are codetermined by a diachronic 
context consisting of the earlier stages of the sequence Let me put 
some flesh on these abstract bones by means of some examples 

Every stage in a thought process 1s usually intelligible only given its 
preceding stages To use a banal case after wondering what the 
product of 3 and 4 is, I realize it is 12 My thought “12” can only be 
understood as the answer to the preceding question in virtue of its 
place in the overall sequence of thoughts Likewise, the occurrence 
of “12” would be unintelligible for me if I were not aware of its place 
in the thought sequence in which it occurs It ıs the order ın the 
sequence, not my being aware of it, that connects the question to the 
answer in a way that makes them continuous ın the sense I am after 
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Thought processes and the like reflect what goes on “in” our 
minds There are, however, also important examples of sequences of 
mental states reflecting what goes on “outside” our minds and bodies 
(this 1s a divergence from Shoemaker, one that will turn out to be 
important) Take another example the contents of your mind when 
you are having a conversation with a fnend If a sample of the 
contents of your mind after half an hour of conversation were trans- 
planted to someone else’s brain (for example, your hearing your 
friend say “I know, but does Julie feel the same about that”), that 
“receiver” would not be able to grasp the full content of that state 
(even if she knew its causal origins) and would be baffled about its 
occurrence You, on the other hand, are in a position to understand 
perfectly well what is being said and why, because your representation 
of the utterance occurs at a specific time 1n a sequence of represen- 
tations “depicting” the whole of your conversation 

Another example of a mental sequence reflecting external states of 
affairs 1s the consecutive occurrence of states representing sensory 
perceptions Sequences of perceptual contents cohere in virtue of 
the intelligibility of the world we perceive Each perception at a given 
moment 1s continuous with preceding and subsequent ones through 
the total intelligible “picture” of the surroundings of the perceiving 
body they draw Each movement of the body evokes a change of 
perception in accordance with the new position the body ac- 
quires Our thorough acquaintance with the basic laws of the three- 
dimensional world of medium-sized objects (including our body) 
allows us to retrieve from these consecutive impressions two 1nter- 
locking stories that are already present in them one about the body's 
movements through space, and one about the perceptual features of 
the body's consecutive surroundings What constitutes the kind of 
continuity I am after 1s the fact that the sequence of contents coheres 
such that we can (though need not) retrieve these stories 

It ıs a feature of this kind of continuity that what may seem to be 
two qualitatively identical mental contents on Parfit’s atomistic, Hu- 
mean bundle-theory of continuity, may well turn out to be different 
contents once the relevance of their diachronic mental contexts 1s 
recognized Take, for example, two qualitatively identical (for the 
sake of the argument) visual perceptions representing the Eiffel 
tower, had by two distinct persons One perception reflects a movie 
screen in London on which a picture of Paris ıs being projected, the 
other one reflects the Parisian construction itself. The full contents 
of these representations differ, even 1f their visual/phenomenal as- 
pects are identical. The content of the impression of the person ın 
London 1s that she 1s "looking at a picture projected 1n a theater in 
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London ” This content 1s not derived from the impression alone It 
is derived from the impression's being embedded in a context—for 
example, walking through London, entering a cinema, watching a 
movie that contains a picture of the Eiffel tower Likewise, an 1m- 
pression caused directly by the real Exffel tower ıs embedded in a 
context—for example, of visiting Paris as a tourist, wandering along 
the Seine, and so on Each of the two contents 1s codefined, so to 
speak, by their diachronic context 

In order to have a label by means of which I can refer to these 
progressions of psychological contents, I shall use the term narrative 
continuity!” (N-continuity for short) with a number of qualifications 
‘Narrativity’ 1s not supposed to introduce teleology or constructivism 
into the notion of psychological continuity Nor ıs the term intended 
to suggest that psychological continuity involves a plot or a "point", 
the stories involved are usually of a mere “and then , and then ” 
kind The aspect of narrativity that nevertheless renders the term apt 
is the one Alasdair MacIntyre!? drew attention to “a perception" or 
“a thought"—for 1nstance—taken ın isolation ıs an abstraction from 
a whole perceptual sequence or a complete process of thought Just 
as a particular event in a story acquires its full sense only as part of the 
whole story, individual thoughts acquire their full meaning only as 
part of a process of deliberation, and individual sense perceptions 
acquire their full sense only as part of a sequence of perceptions 
portraying a body's movements through space-time It is the whole of 
such a sequence that endows its “parts’—which are abstractions 
rather than constituents—with their full meaning 

The notion of N-continuity as described so far 1s still relatively 
vague It 1s extremely tricky to try and offer a formal definition of it, 
however !7 Instead, let me defend four claims about N-continuity 
which will highlight those aspects of 1t which will be relevant to the 
overall argument 


(1) N-continuity does not require or presuppose memory 
(2) N-continuity 1s well capable of capturing the transitivity that 1s a 
sahent feature of identity by providing a life-long psychological 


15 No similarity with Paul Ricoeur’s narrative theory of personal identity ıs ın- 
tended, see Oneself as Another (Chicago University Press, 1992) 

16 After Virtue (Notre Dame University Press, 1981) pp 217ff 

17 See Igor Douven, “Marc Slors on Personal Identty,” Philosophical Explorations, 11, 
2 (1999) 143-49, and my “A Reply to Igor Douven,” Philosophical Explorations, 11, 2 
(1999) 150-52 For a more elaborate treatment of N-continuity, see my The Dia- 
chronic Mind An Essay on Personal Identity, Psychological Continuity and the Mind-Body 
Problem (Boston Kluwer, 2001), chapter 3 
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narrative In normal circumstances, in the actual world, this 1s due 
to the fact that we are embodied beings 

(3) Nevertheless, N-continuity 1s not bodily continuity in disguise 

(4) Although N-continuity ıs more than hkely to be causally under- 
pinned, it cannot be analyzed in terms of causality 


If these claims can be established, N-continuity can be considered a 
kind of psychological continuity which 1s distinct from Parfitian con- 
unuity, notably due to the fact that it does not involve memory, but 
which is potentially just as much capable of analyzing personal 
identity 

(1) N-contmusty does not presuppose memory I have been talking so far 
about the relevance of the diachronic context of a mental state in 
determining this state's content and intelligibility, without distin- 
guishing between the past mental states that make up this context 
and their representations that (usually) co-occur with the mental 
state whose content they codetermine The distinction 1s important, 
however, for 1t may seem as 1f N-continuity involves memory, and the 
distinction helps to explain why this ıs not the case To use some 
labels, let me speak of the dzachronic context of a mental state when 
I mean to refer to earher mental states ın a sequence, and speak of 
cotemporal context when I mean to refer to the set of co-occurrmg 
mental states that may or may not contain representations of (part 
of) the diachronic context 

The idea that N-continuity presupposes memory is the 1dea that for 
the full content and intelligibility of a given mental state m to be 
codetermined by that state's diachronic context, this context (or at 
least the relevant bits thereof) ought to be represented in the cotem- 
poral context of m Since the representation of earlier mental states 
in a present frame of mind just 1s memory, 1f representation of the 
diachronic context in the cotemporal context 1s required for there to 
be N-continuity of m with 1ts mental predecessors, this would mean 
that N-continuity requires memory 

In order to keep N-continuity and memory apart, it 1s crucial to 
distinguish sharply between N-continuity, on the one hand, and 
awareness thereof, on the other While awareness of N-continuity of 
m with its predecessors does require the diachronic context of m to be 
represented 1n its cotemporal context, N-continutty itself does not 
require this That 1s, the representation of earlier mental states 1s 
required for a "subject"? to be aware of, or have access to, the full 


18 The term ‘subject’ ıs not meant to refer to a substantial self that exists 
independently of the mental states ıt ‘has’ Rather, ıt refers to a function of these 
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content of her mental states But the full content of a mental state in 
the sense relevant to N-continuity 1s not determined by her actually 
accessing it, ıt 15 codetermined by her being able to do so, that 1s, by 
the occurrence of previous mental states, not by these previous states’ 
later representation 

An example might help here Take the above case of the two 
perceptual representations of the Eiffel tower, one caused by the 
tower itself, and one caused by a picture on a movie screen Now 
suppose that the two persons ın these perceptual states suffer from 
amnesia Their long-term and short-term (episodic and iconic) mem- 
ory are impaired (their semantic memory can be left intact) One 
perception 1s that of the real Exffel tower 1n Paris, the other 1s of a 
picture on a movie screen in London Given their amnesia, however, 
neither of the two persons 1s capable of accessing these full contents 
From their perspectives, both states have identical contents But that 
1s merely because they both have access only to the visual/phenom- 
enal part of their perceptual states Given their amnesia, they do not 
have access to the (N-continuous) stream of mental states which leads 
up to these perceptions and endows them with contents richer than 
their phenomenology can be credited for But that does not make 
these perceptions less N-continuous with their preceding mental 
states they still have a content that is there to be potentially discerned 
and put to use as soon as the amnesia 1s over, a content that 1s 
determined by their diachronic psychological context 

One might object to the term ‘content’ when a person need not be 
aware of it But such an objection would not refute the dissociation 
of N-continuity and memory Accommodating it can be done in 
either of two ways One is to redefine N-continuity in counterfactual 
terms one mental state m 1s N-continuous with a series of previous 
mental states 1f and only if when m’s cotemporal psychological con- 
text contains representations of these previous states, m’s full content 
becomes available to the subject Alternatively, the notion of ‘con- 
tent’ itself can be cast in counterfactual terms A state’s full content, 
then, 1s determined by what would be available to the subject should 
the diachronic context of this state be represented 1n 1ts cotemporal 
context The state would have this content, even when the diachronic 
context 1s not actually cotemporally represented This position would 
be analogous with externalism about mental content Here, too, a 





states themselves, such as the having of a consistent set of second-order states The 
subject, then, is something like a center of narrative gravity, to use Dennett’s term 
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state can have its content determined by things that are not neces- 
sarily available to the subject 

Either way, a given mental state can be N-continuous regardless of 
whether its cotemporal context actually contains representations of 
previous mental states A mental state can thus be N-continuous with 
its predecessors even though a lack of memory precludes awareness 
of this The fact that N-continuity and memory are correlated more 
often than not 1s just as well, though, since 1t allows us to access the 
full contents of our mental states But the correlation 1$ contingent— 
N-continuity itself does not require memory 

(2) The N-contunuaty-based unity of our psychological lives In labeling 
the mental processes I am concerned with here cases of narrative 
continuity, I am making an as yet unargued for, but nevertheless 
important assumption The term ‘continuity’ 15 meant to refer to a 
relation that 1s characterized by the transitivity that 1s a salient feature 
of identity but not of connectedness For narrative sequences to be 
called “continuity,” ıt must be shown that such sequences constitute 
a life-long transitive relation other than merely by overlapping In 
other words, there has to be a sense in which the mental life of one 
person consists of one narrative sequence 

There is a sense 1n which my mental life 1s one large narrative, 
despite the fact that my thoughts may jump from topic to unrelated 
topic, despite the fact that feelings can sometimes arise “out of the 
blue,” despite the fact that I lose consciousness when I sleep, de- 
spite—to summarize—the fact that our consciousness 1s patchy like a 
quilt and (1n Daniel Dennett’s term) "gappy " Let me first elaborate 
on how gappiness and patchiness does not stand ın the way of there 
being one narrative organization of the mental lives of actual persons, 
and then turn to the question whether such an organization can 
survive hypothetical scenarios such as brain transplants 

With regard to our actual situation, what I have in mind is this 
despite the fact that our psychological lives are gappy and patchy, 
there 1s a basic narrative that 1s neither patchy nor gappy This basic 
narrative ıs represented by our consecutive perceptual contents No 
doubt other types of content also play a unifying role, but I concen- 
trate on perceptual contents because their unifying role 1s, I think, 
more transparent than that of other types 

The two stones told by the successive impressions of our senses are, 
as I briefly indicated, the complementary stories of a body's move- 
ments through space and time and the story of the particular per- 
ceptible features of its surroundings at consecutive times Both stories 
interlock into one story about an objective continuant's successive 
whereabouts Successive perceptions are the subjective counterpart 
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of the objective event of one body moving through space and per- 
ceiving its surroundings 

Successive perceptions acquire narrative coherence ın virtue of the 
fact that they are caused by one body’s movements through a stable 
(not static) physical world We are acquainted with the character and 
proceedings of this world so that we are able to appreciate this 
coherence This 1s largely in agreement with John McDowell “con- 
tinuous ‘consciousness’ 1s intelligible (even ‘from within’) only as a 
subjective angle on something that has more to it than the subjective 
angle reveals namely, the career of an objective continuant ” (op 
cut, p 233) 

As it stands, though, McDowell’s claim 1s stronger than what I am 
after For one thing, I do not want to claim that psychological 
continuity as a whole hinges on the continuity of an objective con- 
tinuant or body I only claim that N-continuity of perceptual contents 
does The narrative unity that perceptual contents provide, I shall 
claim, 1s necessary but not sufficient for full-blown psychological 
continuity Furthermore, I do not take 1t as a conceptual truth that 
the diachronic unity of mind that constitutes personal identity over 
tume—according to a psychological criterjon—requires a perceptual 
narrative for which bodily continuity, 1n turn, 15 a prerequisite It may 
well be an empirical fact, though, that the diachronic unity of mind 
of all actual persons ıs secured for a large part through bodily 
continuity 

With these qualifications 1n mind, and bearing in mind that we are 
discussing actually existing persons, the crucial unifying role of per- 
ceptual contents stems from the fact that, in Peter Strawson's!? words, 
“a temporally extended series of experiences should have a certain 
character of connectedness and unity, secured to 1t by concepts of the 
objective That experience should be experience of a unified ob- 
jective world at least makes room for the idea of one subjective or 
experiential route through the world, traced out by one series of 
experiences which together yield one unified experience of the 
world—a potential autobiography" (:b:d, p 193) 

Perceptual contents accompany virtually all of our conscious life 
Given that its source, an intelligible world, 1s a coherent whole— 
space-time and causally related objects—their sequences cannot be 
patchy Finding myself at place a, I need not expect to have percep- 


19 The Bounds of Sense (London Methuen, 1966), see also Quassim Cassam, “Kant 
and Reductionism,” The Review of Metaphysics, xii (1989) 72-106, “Parfit on 
Persons," in Proceedings of the Arestotehan Society, xcu (1993) 17-37, and Gareth 
Evans, The Varieties of Reference, McDowell, ed (New York Oxford, 1982) 
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tions of place b 100 miles away within the next ten minutes We 
cannot be here one moment and a hundred miles from here the 
next If we go from a to b, we shall pass through all intermediate 
places separating a and b according to the route we take, thus 
producing a sequence of perceptions narrating the story of this route 
(the position of our bodies at different places, the position of our 
eyes, and so on) ?? The story simply cannot just stop to make room 
for a different one Perceptual continuity, then, cannot be patchy 
(And if ıt appears so, we know we are the victim of an illusion, the 
uncovering of which restores the continuous story ) 

Perceptual continuity is gappy, though Every now and then, mostly 
when we go to sleep, we lose consciousness But this does not at all 
imply that the narratives that are conveyed by or implicitly present in 
our perceptions are gappy Usually, we wake up where we went to 
sleep The story of our body's movements, then, continues where it 
was broken off It 1s not unimaginable, though, that we wake up 
somewhere else In such a case, it 1s misleading to say that the 
perceptual narrative is gappy or incomprehensible The concept of 
our body as a physical object—which may either be the product of 
induction from earlier perceptions, or of evolutionary hardwiring— 
regulates the story told by our consecutive perceptions, whether we 
are aware of it or not Thus, the diachronic context of one’s percep- 
tions when waking up somewhere else than where one went to sleep, 
will endow those perceptions with a content that, in linguistic terms, 
reads “the perceptions while waking up after the body that issues 
them has been moved while unconscious,” or something to that 
effect Perceptual narratives, then, are subjective counterparts of 
complete objective narratives, and as such not gappy ?! 

N-continuous perceptual contents provide our psychological lives 
with what I shall call a basic narrative A perceptual narrative 1s “basic” 
in the sense that it provides a lifelong narrative that unifies the 
multitude of mental states that makes up a “stream of consciousness,” 
while it 15 ın itself insufficient for full-blown psychological continuity 
It ıs an elementary thread running through my "stream of conscious- 
ness,” connecting all its different temporal stages Moreover, ıt pro- 


æ Parfit’s teletransportation thought experiment seems to contradict this For 
reasons to ignore teletransportation, see the last section of my "Two Conceptions of 
Psychological Continuity,” Philosophical Explorations, 1, 1 (1998) 59-78 

?! This does not preclude our misconstrumg perceptual narratives or our mis- 
represenung them when we remember (part of) our past narratives The point here 
1s merely that any interpretaton of a string of perceptions requires the supposition 
that there is at least an objective story to be told which connects these perceptions 
with a certain well-determined set of sense organs 
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vides an elementary “format” for the structure of such streams by 
forming the diachronic background against which all and only the 
mental states of one person occur Such a background puts rather 
severe constraints on thoughts, intentions, beliefs, and so on, thus 
securing a high degree of coherence, since most nonperceptual 
mental states are specifically connected with and shaped by a partic- 
ular episode of a basic narrative Let me elaborate on that 

Nonperceptual mental contents such as beliefs and thoughts do 
not merely co-occur with perceptual ones, as, for example, Parfit (op 
cit, pp 250-51) would have it Thoughts, desires, and beliefs may, for 
instance, be directed at items in a person’s direct proximity, I may 
attend to what my conversation partner 1s saying, my attention may be 
with a book I am reading, I may be busy trying to find the quickest way 
of getting somewhere, I may be enjoying a concert, and so on, and so 
on In such cases, 1t 1s easy to see how thoughts interrelate with the 
basic narrative against the background of which they occur The 
same goes for intentions, which take into account our present where- 
abouts and relate that to a future action at a future place 

But even when thoughts, beliefs, and desires do not concern a 
person’s actions or direct environment, there are almost always con- 
ceptual, evidential, or biographical ties to the subject’s basic narra- 
tive Conceptual ties stem, for example, from our acquaintance with 
a stable physical world—including our bodies—and its basic regular- 
ities, which 1s conveyed through our basic narrative (My belief that 
the Exffel tower 1s ın Paris implies that ıt ıs about 600 kilometers from 
where I am now, that 1t will take me about 5 hours by tram to get 
there, and so on ) Evidential connections between beliefs and a basic 
narrative are there simply because whenever there ıs perceptual 
evidence for a belief, a perception will also fit in and be part of a 
larger basic narrative. Biographical connections between one's be- 
liefs and desires and one's basic narrative, finally, stem from the fact 
that a basic narrative, being a major constituent of a biography, 
encompasses one’s experiences and hence determines one’s “food 
for thought” 

Thus, a basic narrative 1s intertwined with most of our other 
thoughts, beliefs, and desires This secures a high degree of synchro- 
nic and diachronic coherence between these thoughts, beliefs, and 
desires themselves Cotemporal mental states of one person are 
subject to the same constraints issued by the episode of the basic 
narrative with which they co-occur This gives them a nonaccidental 
“place” on the *time-scale" of the basic narrative, so to speak, so that, 
otherwise unrelated thoughts, for example, nevertheless do have a 
rationale for their order of occurrence that confers to them a mini- 
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mal degree of N-continuity Since a basic narrative ıs life-long (start- 
ing when one’s perceptual-cognitive apparatus is well developed), it 
can hence unify a psychological life into one N-continuous sequence 

(8) N-contenuzty ıs not bodily continuity in disguise In the case of 
actually existing persons, my claim 1s that bodily continuity provides 
a basic narrative that unifies our psychological lives into one life-long 
narrative Nevertheless, N-continuity does not simply boil down to 
bodily continuity supplemented with a number of psychological sub- 
narratives 

While a basic narrative 1s necessary to secure a degree of narrative 
unity of a person’s psychological life, and while bodily continuity 1s 
sufficient to provide a basic narrative given a properly functioning 
perceptual and cognitive system, nothing that has been said so far 
ımphes that bodily continuity 1s logically necessary for a basic narra- 
tive That is, bodily continuity ıs very likely to be the only provider of 
a basic narrative, although we cannot rule out on a prion grounds 
hypothetical scenarios 1n which persons have full-blown N-continuity 
without bodily continuity 

Take a case in which the contents of a person’s brain are trans- 
planted into another brain and body If bodily continuity 1s a neces- 
sary prerequisite for a basic narrative, then in such a case there would 
be a breach in narrative psychological continuity We can imagine the 
case, however, such that the role of the basic narrative 1s provided— 
for a brief period anyway—by some other psychological mechanism 
than perceptual representations of bodily whereabouts For instance, 
it 1s far from unimaginable, that, 1f the person were informed before- 
hand of the body-swapping operation, her expecting a radical change 
1n proprioceptual and perceptual contents after the operation would 
“bridge” at least part of the narrative gap that threatens to disrupt 
N-continuity Likewise, informing the person's peers about the op- 
eration might result in their confirming the body swap and hence the 
N-continuity of the person at issue, a confirmation that might make 
the difference between confusion or even mental disintegration, on 
the one hand, and mental coherence and sanity, on the other 

We can extrapolate from this example to the truly bizarre scenarios 
that turn up every now and then in the debate on personal identity 
Take a scenario 1n which persons change bodies every week (or day, 
or hour, or minute) In such a scenario, we can imagine these 
persons being informed about these changes so that they are able to 
bridge what to us would be significant gaps in N-continuity But we 
can also, and perhaps better imagine this information being “wired 
into” their cognitive systems, ın much the same way that bodily 
continuity 1s wired into ours as the rationale for processing successive 
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perceptual contents In either case, these persons do have a basic 
narrative that unifies their psychological lives But this basic narrative 
will provide a “format” for the structuring of the psychological lives of 
these persons which will differ substantially from ours (For instance, 
while we can base our beliefs about the color of our hair right now on 
memories of looking into the mirror weeks ago, they cannot) 

To summarize, a basic narrative ıs a necessary condition for full- 
blown N-continuity But since bodily continuity 1s only sufficient 
(given a properly functioning cognitive system) for such a basic 
narrative and not logically necessary, N-continuity 1s strictly a form of 
psychological continuity It 1s not bodily continuity in disguise, de- 
spite the fact that in our actual world N-continuity does require bodily 
continuity 

(4) N-contenusty cannot be analyzed ın terms of causality So far, the 
relations between mental states comprising an N-continuous stream 
are characterized purely in terms of the contents of these states, and 
not in terms of causal relations between them This does not mean 
that N-continuity 1s likely to exist without being “causally under- 
pinned.” No plausible metaphysics of the mental would allow for the 
absence of causal connections—at least at the level of the (physical) 
realizers of mental states 

So far so good But now the following question arises ın Parfi- 
tian psychological continuity, causality coanalyzes what continuity 1s 
about (continuity is overlapping connectedness, and relations of 
connectedness, such as memory, are codefined in terms of causality) 
Is ıt the same with N-conunuxty? The answer to this question 1s 
negative The reason is that we cannot 1dentify the causal relations 
underlying N-continuity in abstraction from N-continuity The causal 
relations underlying N-continuity are way too complex and scattered 
for this, infinitely more so than the allegedly straightforward causal 
relations that secure psychological connectedness on the Parfitian 
model There are two reasons for this 

First, there 1s a diachronic mental holism to which N-continuity 1s 
committed The fundamental idea behind N-continuity ıs that mental 
states at one point ın tıme acquire their full content in virtue of being 
embedded 1n a specific diachronic psychological context Further- 
more, we have to do justice to a version of the synchronic mental 
holism that inspired the argument against Q-memory (section II, it 1s 
unimportant which version) The “picture” of the mind yielded by 
both kinds of holism ıs one in which it ıs impossible to single out a 
given causal connection as the one that connects, for example, this 
experience with this recollection, or this question with this answer, 
or and so on Such mental connections require what I earlier la- 
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beled context contmuity And that means that a very large number of 
causal connections are at play in what in mental terms would be a 
single connection 

Second, many causal connections underlying N-continuity do not 
exist wholly “within the subject" The best example here is the 
perceptual continuity involved in our basic narratives When I turn 
my head, I have consecutive perceptions of my computer, some 
books, a photo on the wall, a table, and so on These perceptions are 
N-continuous But they are not causally continuous ın any sense that 
resembles the causal continuity involved in Parfitian psychological 
continuity One perception does not cause the next one 

Nevertheless, causal relations do underlie the N-continuity of per- 
ceptual contents At consecutive times, light rays reflected from the 
objects of my perceptions cause changes on the surface of my retina 
which cause changes in the optic nerve, which cause certain percep- 
tual states of the brain, the movement of my head causes my eyes to 
be in the positions in which they are susceptible to the light rays 
emitted from these objects, and so on But the point here 1s that these 
causal relations are scattered, so to speak They exist partly within the 
subject and partly without 

The complexity of the causal relations underlying N-continuity, 
and the fact that many of them involve the outside world just as well 
as what goes on inside our heads, makes for a situation in which we 
can only identify the causal relations involved (in principle, not in 
practice) when we already have a firm grasp of the concept of 
N-continuity Without the notion of N-continuity, there 1s no simple 
principle by means of which to demarcate the class of N-continuity 
underlying causal relations (such as, for example, ‘only the causal 
relations inside a person’s head count’) Hence, the causal connec- 
tions at play cannot be identified in abstraction from N-continuity, so 
that the former cannot analyze the latter 


IV MEMORY, N-CONTINUITY, AND THE 
CONTEXT-CONTINUITY REQUIREMENT 


In the previous section, I argued that N-continuity does not presup- 
pose memory, even though awareness of it does Now I argue that the 
relation between memory and N-continuity ıs asymmetrical memory 
does require N-continuity The reason 1s that N-continuity explains 
how and why the context-continuity requirement for veridical mem- 
ories can be met, whereas it could not be met by a Q-memory based 
conception of psychological continuity (section 11) There are at least 
three interrelated ways ın which N-continuity is involved in this 
requirement such that 1t 15 presupposed by memory 
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(1) Each (experience-)memory requires an appropriate context to 
bring out its full content and epistemic entitlements, that 1s what the 
holism-based criticism of Q-memory shows Such a context consists, 
to a large extent, of other memories, ranging from concrete experi- 
ence memories conveying the context in which one had the experi- 
ence whose recollection 1s at issue, or more conflated, condensed, or 
summarized memories,?? or even inferred (but not really remem- 
bered) knowledge of one's past For instance, the cotemporal con- 
text of my remembering my wife's last birthday dinner, contains 
concrete episodic memories (for example, of my cooking ıt), seman- 
tic memories conveying the meaning of, for example, ‘birthday’, 
*wife', and so on, and condensed biographical memories conveying, 
for example, the fact that this birthday dinner 1s part of a tradition of 
birthday dinners The mechanism by which such contexts are estab- 
lished, then, 1s memory (and sometimes inference) and not N-contt- 
nuity But such mechanisms are insufficient to explain why and how 
the context-contmuity requirement 1s fulfilled 

What needs to be explained as well is how these mechanisms get 
the appropriate “inputs,” so to speak The context of a given memory 
consists, to a large extent, of the representation of bits of a person’s 
biography that codetermine the specific content of the memory For 
a memory to have the right kind of cotemporal context, then, the 
remembered experience and the memory should be part of the same 
*psychobiography" that contains the experiences and knowledge to 
be potentially represented 1n the context of that memory In a largely 
atomistic conception of psychological continuity, there 1s no guaran- 
tee at all that a person's psychobiography contains the right kind of 
knowledge and experiences to make up a given recollection’s context 
that allows us to make sense of the experience of remembering If, by 
contrast, the succession of mental states in a person's life 1s consid- 
ered to be structured along the lines of N-continuity, yielding the 
psychological counterpart of a coherent biography, ıt ıs more than 
likely that for almost each memory a person has, there 1s a set of past 
experiences, bits of knowledge, and so on in the rememberer's 
psychobiography that, when represented as the memory's cotemporal 
context, allows that memory's full content to be accessible so that 1t 


22 See, for example, Schechtman, “The Truth about Memory," Philosophical Psy- 
chology, vit, 1 (1994) 8-18, CR Barclay and P A DeCooke, "Ordinary Everyday 
Memories Some of the Things of Which Selves Are Made,” in U Neisser and E 
Winograd, eds , Remembering Reconsidered Ecological and Traditional Approaches to the 
Study of Memory (New York Cambridge, 1988), pp 91-126, LW  Barsalou, "The 
Content and Organization of Autobiographical Memonies," ın Neisser and Wino- 
grad, eds, pp 193-244 
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retains 1ts epistemic entitlements The fact that the actual represent- 
ing 1s done via memory does not make N-continuity dispensable or 
redundant in an explanation of how the context-continuity require- 
ment can be fulfilled 

(2) The full content of an experience-memory includes the “place” 
of the remembered experience on a “subjective time-scale " With the 
passing of time, the propositional contents of the beliefs involved 
in experience and memory change like this “It is Y now” (original 
experience), “It was Y a moment ago” (recollection shortly after 
the experience), “It was Ya while back” (recollection long after the 
experience), and so on ?? These contents have to change in order to 
keep one identical memory alive And they must change in an order 
that fits the (basic) narrative of the person's temporally distancing 
herself from the original event of experiencing (A similar point can 
be made about intentions ?*) 

For such a diachronic change of propositional contents to be 
possible, the context of a memory should contain some notion of the 
temporal distance between the present person-stage and the person- 
stage that had the original experience And for that it ıs crucial that 
both the experience and the recollection are part of a psychological 
narrative that 1s unified by a basic narrative. For only thus can we 
conceive a psychological mechanism by means of which their tempo- 
ral distance 1s represented with a (usually not too high) degree of 
accuracy the remembered experience 1s "located" on the subjective 
biographical time-scale of the rememberer’s N-continuous psycho- 
logical life (by relating it, for instance, to a number of time-marking 
life events) The propositional contents of memories thus require the 
corepresentation, albeit extremely abbreviated, of a part of the re- 
memberer’s N-contnuous psychobiography in which past mental 
events have a definite order (This ıs in itself a form of N-continuity 
a memory requires a diachronic context for 1ts full content ) 

(3) The context-continuity requirement hinges on the coherence 
of a memory with its psychological context This context contains 
both other memories and, for example, representations of the re- 
memberer's physical and social surroundings But coherence itself 
requires norms The clash of a Q-memory with an alien context 1s the 
violation of some of these norms Conversely, the coherence of a 
given memory with knowledge of one’s biography, as well as with 
one’s present representations of one’s objective and social environ- 


23 See Evans, The Vanettes of Reference, pp 236-39 
24 See, for example, Shoemaker (with Swinburne), p 95 
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ment, is compliance with such norms So where do these norms come 
from? 

A significant part of the answer to this question 1s that the elements 
that structure our basic narrauves— continuity of the body as an 
object ın the physical and social world, acquaintance with the behav- 
1or of medium-sized objects and their restrictions, and so on—are 
exactly those which determine a large part of the norms governing 
the coherence of a memory with its psychological context Why does 
your memory of taking your children to school (see section 11) not fit 
into the context of my mind? Because, for example, my “internaliza- 
tion” of that memory would violate in various ways the norms that 
govern our basic narratives and the N-continuous psychobiographues 
that occur against their backgrounds There 1s no way in which my 
body could have traced a spatiotemporal path that would have en- 
dowed me, for instance, with your memory but not with memories 
telling me how and why I have wound up at this strange place taking 
two strange children to school, but with other memories (the ones I 
actually have) instead N-continuity and especially the basic narrative 
provides a relatively specific “format” for the organization of infor- 
mation about one's life which puts heavy restrictions on which mem- 
ories can possibly represent experiences that would fit into a past 
stage of one’s psychobiography 

Since coherence 1s without a doubt the most crucial epistemic 
criterion for the veridicality of our memories, N-continuity including 
a basic narrative ıs a conditzo sine qua non for memory as an epistemic 
device that gives us access to past experiences 


V A NEW SOLUTION TO THE CIRCULARITY PROBLEM 
The circularity objection against the 1dea that psychological continu- 
ity can analyze the concept of personal identity comprises two claims 


(A) Psychological continuity can be analyzed in terms of overlapping 
chains of connectedness 
(B) The most important kind of connectedness—veridical memory— 
presupposes personal identity because it can be defined thus 
(1) I am presently ın a state as if remembering having a specific 
experience 
(2) The content of that state is qualitatively identical or very similar 
to a past experience 
(3) I had that experience 


Requirement (B) (3) introduces personal identity into the definition 
of veridical memory Therefore, veridical memory presupposes what 
it means to coconstitute according to (A) Hence there 1s circularity 
in analyzing personal identity ın terms of psychological continuity 
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The Q-memory solution attacks claim (B) by inventing a definition 
of memory that does not presuppose personal identity It does so by 
replacing (3) with 


(4) My memory is causally dependent, in the right kind of way, on the 
past experience that 1s qualitatively identical or similar to my mem- 


ory 


In section r1, I argued that this “solution” is not, ın fact, feasible A 
definition of memory in terms of (1), (2), and (4) crucially lacks a 
principle that secures the presence of an appropriate psychological 
context for every recollection, a context that potentially brings out its 
full content and epistemic entitlements 

In section n1, I sketched the outlines of a type of psychological 
continuity that cannot be analyzed in terms of overlapping chains of 
psychological connectedness N-continuity In section 1v, I argued 
that N-continuity 1s a necessary condition for recollections to have 
appropriate psychological contexts to bring out their full contents 
and epistemic entitlements 

This may seem to open up the possibility of patching up the 
Q-memory solution, (1), (2), (4), and N-conunuity make for a pos- 
sible definition of memory that does not presuppose personal 1den- 
tity In fact, the N-continuity of an experience and a recollection 
obviates the need for (4) in about the same way (4) obviates the need 
for (3) (see section 1) Under any plausible theory of mind, the 
N-continuity of an experience and a recollection encompasses their 
causal connection We may therefore replace (4) with 


(5) My state of apparent remembering must be N-continuous with that 
past experience 


(1), (2), and (5) can arguably make for a definition of memory that 
does not presuppose personal identity and does not fall prey to the 
objection leveled against the Q-memory definition 

Once N-continuity 1s acknowledged, however, a much simpler so- 
lution to the circularity objection becomes feasible, one that explains 
(rather than explain away) the nagging intuition that memory cannot 
really be defined without personal identity Once N-continuity 1s 
acknowledged, (A) can be rejected psychological continuity cannot 
be analyzed in terms of overlapping chains of (strong) connected- 
ness If (A) can be denied, the possible truth of (B) seems irrelevant 
My claim 1s that N-continuity may analyze personal identity noncir- 
cularly, N-continuity 1s a transitive, life-long kind of psychological 
continuity that does not presuppose personal identity the way cont- 
nuity defined in terms of connectedness appears to 
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Where does this leave memory? Does memory become irrelevant to 
personal identity? My answer to this last question 1s both “yes” and 
“no” 

I argued in section 111 that N-continuity does not require memory, 
but that awareness of ıt does If memory ıs required for us to be aware 
of N-continuity, and if N-continuity can analyze personal identity, 
then memory 1s required for us to be aware of our own identity over 
tme Memory can thus be an epistemological criterion for personal 
identity without constituting ıt This ıs perfectly in line with Butler’s 
original objection against Locke’s memory criterion, which was 
meant to uncover Locke’s conflation of epistemological and meta- 
physical criteria Recognizing this, however, does not imply the re- 
jection of a metaphysical-cum-semantic psychological criterion of 
personal identity ?5 

With regard to the Lockean psychological criterion, ıt ıs sometimes 
remarked that ıt contains a “benevolent” kind of circularity that 
cannot be argued away ?? The definition of memory ın terms of (1), 
(2), and (5) still contains that circularity For the very concept of 
memory, which ıs an epistemological criterion for N-continuity/ 
personal identity presupposes the concept of N-continuity/personal 
identity But this 15 no longer the vicious circulanty that undermined 
the memory criterion of personal identity For N-continuity 1s not 
codefined 1n terms of memory, so that memory does not presuppose 
itself 

MARC SLORS 
Nymegen University 


25 The proposed criterion of personal identity 1s only partly Lockean 1n spint It 
1s Lockean because ıt 1s a psychological criterion But Locke’s criterion was thor- 
oughly driven by emprncist concerns, personal identty could not, according to 
Locke, be something that we might not even be conscious of N-continuity, by 
contrast, 15 something we are often not aware of at all 

?6 For instance, by McDowell in “Reductionism and the First Person”, and by 
Shoemaker 1n his "Personal Identity and Memory,” this JOURNAL, LVI, 22 (October 
22, 1959) 868-82 
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THE MOST FAMOUS EQUATION*, gs 
A typical undergraduate physics textbook, written by Joseph W 


Kane and Morton M Sternheim,! says 


The equation E = mc? 1s perhaps the most famous equation of twentieth- 
century physics It 1s a statement that mass and energy are two forms of the 
same thing, and that one can be converted mto the other (4id., p 493) 


With the first of these sentences, concerning the fame of Albert 
Einstein’s equation, ıt 1s difficult to disagree But the second sen- 
tence, which characterizes mass and energy as interconvertible and 
“two forms of the same thing,” suggests that some care will need to be 
exercised in interpreting the famous equation Indeed, it 1s difficult 
to find a scientific equation whose ontological implications have been 
misunderstood so widely and 1n so many ways I shall begin by looking 
at some of the issues that arise when interpreting the famous equa- 
tion Along the way, I shall examine briefly some of the ways m which 
one could become confused by the equation’s common interpreta- 
tions—not only ın ordinary physics texts, but also by some very 
distinguished physicists and philosophers I shall then try to set the 
record straight ? 


I SOME ISSUES 
Let us begin to lay out some of these interpretive issues by taking 
seriously the textbook's remark that mass "can be converted into" 


* My thanks to Arthur Fine for comments on the penultimate draft of this essay, 
and to Alexander Rosenberg for his encouragement 

1 Physics (New York Wiley, 1978) 

? To keep things simple and focused, I shall restrict myself throughout to inter- 
pretation of the special theory of relativity, leaving aside considerations arising from 
the general theory, quantum mechanics, and so forth 
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energy and vice versa Strictly speaking, this does not mean merely 
that when mass m disappears, a quantity E of energy takes its place, 
where E = mc? To say that some mass 1s "converted" into energy 1s to 
say that the mass turns into the energy that replaces 1t—that there 1s 
then a certain parcel of energy that used to be the mass 

For example, after a radioactive nucleus decays, there 1s often said 
to be a *mass defect" the sum of the masses of the daughter bodies 
is less (by Am) than the mass of the onginal nucleus Some (Am) of 
the onginal mass 1s said to have been "converted" into the kinetic 
energy E of the daughter bodies, where E — (Am) c? Since c (light’s 
speed in a vacuum) is so large, a very small mass can be "turned into" 
a great deal of energy For instance, when a tritium nucleus (one 
proton, two neutrons) decays into a helium-3 nucleus (two protons, 
one neutron) along with an electron and an antineutrino, the triti- 
um's mass exceeds the sum of the products' masses by a small 
quantity that is “equivalent” to about 0 0186 million electron volts of 
energy Has mass turned into energy, or merely disappeared and 
been replaced by an *equivalent" quantity of energy? 

In classical physics, at least, a body's mass 1s often characterized as 
its “quantity of matter "? Mass, on this view, 1s a property, whereas 
matter 1s a stuff, a substance If mass 1s the quantity of matter, then 
perhaps the physics text should have said that matter 1s converted into 
energy Among the physical processes called to mind by this 1nter- 
pretation are a caterpillar being transformed into a butterfly and 
liquid water turning into steam 

Energy, on this interpretation of the famous equation, would have 
to be like matter ın being a stuff Classical physics sometimes seems to 
treat energy ın that way Occasionally, energy conservation 1s stated as 
the principle that energy ıs “neither created nor destroyed" rather 
than as the principle that an isolated system's total energy remains 
constant A parcel of energy 1s sometimes described as being con- 
verted from one form into another or as moving from one place to 
another—for example as originally being stored as chemical energy 
in a battery, then passing through the electric and magnetic fields, 
eventually passing to a resistor (perhaps a flywheel, which possesses 
the parcel as kinetic energy), and finally (let us suppose) passing via 


3 “Quantity of matter ıs a measure of matter that arises from is density and volume 
jomtly I mean this quantity whenever I use the term ‘body’ or ‘mass’ in the 
following pages”—Definiuon 1 ın Isaac Newton, The Principia Mathematical Princi- 
ples of Natural Philosophy, 1B Cohen and A Whitman, trans (Berkeley California 
UP, 1999), pp 403-04 
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friction to the surrounding air molecules as heat energy (that 1s, as 
the air molecules’ kinetic energies of random motion) 

This view of energy gained support in the late nineteenth century 
Like other forms of potential energy, electric or magnetic potential 
energy had onginally been considered as having an arbitrary zero 
level and as being possessed by a system as a whole rather than as 
having some definite distribution among the system’s constituents 
But with the rise of James Clerk Maxwell's electromagnetic theory, 
electric and magnetic potential energy were reinterpreted as electro- 
magnetic field energy Accordingly, a field of zero strength was taken 
to possess Zero energy, and Maxwell’s electromagnetic theory deter- 
mined a field’s energy density in space All of this supported the view 
of energy as a substance that can flow, such motion requires that the 
energy at one place and tıme be the self-same thing as the energy at 
a certain other place and time 

On the other hand, many notable followers of Maxwell (such as 
Heinrich Hertz and Oliver Heaviside*) had grave doubts regarding 
what Heaviside called the “thingness” of energy To mention just one 
source of disquiet Maxwell’s equations fail to fix uniquely the vector 
describing the flow of energy (specifically, the energy-flux density) 
through the electromagnetic field The flux of the Poynting vector 
over an entire surface enclosing a volume 1s ordinarily interpreted as 
giving the rate at which electromagnetic-field energy is leaving the 
enclosed volume But numerically the same result 1s given by the flux 
of the Poynting vector plus any vector field possessing zero diver- 
gence While these two formulas for the energy-flux density agree on 
the net energy flow into or out of a given volume, such as the volume 
enclosed by a given spherical surface, the two formulas may disagree 
regarding the energy flow across a given open surface, such as the 
spherical surface’s top hemisphere (For instance, 1f the divergence- 
free vector field 1s characterized at every point by the same upward- 
pointing vector, then it portrays some additional energy as flowing 
into the volume through the bottom surface, as well as an equal 
additional energy flow out of the volume across its top surface These 
two flows together have no net effect on the total enclosed energy ) 
The energy flow across an open surface is left entirely undetermined 
by classical electromagnetic theory, it might be termed “superfluous 
theoretical structure ” Perhaps, then, there 1s no fact of the matter 
regarding the energy flow across an open surface—because energy 1s 


* Electrical Papers, Volume 2 (Boston Copley, 1925), p 527 
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not a stuff (any more than velocity 1s) * Likewise, perhaps a body's 
mass 1s not the quantty of 1ts matter, but rather merely a quantitative 
property determing the body's resistance to being accelerated by a 
force (This was Maxwell’s® own understanding of mass ) On this 
interpretation of mass and energy, we could not join Ehe Zahar” in 
interpreting Einstein's famous equation as characterizing the under- 
lying identity of what ordinarily appear to be two distinct substances 


According to the accepted mechanistic view, ordinary “ponderable” 
matter, 1e, the extended hard "stuff" with which we are all familiar, 
constitutes the most fundamental layer of physical reality — [M]atter 
and energy were both conserved, but each separately from the other 
Einstein showed that the two levels could be regarded as identical 
The stuff which appears to the senses as hard extended substance and 
the quantity of energy which characterises a process are ın fact one and 
the same thing (:b:id, pp 261-62) 


If energy and mass do not indicate quantities of stuffs the parcels of 
which have continuing identities over time, stuffs that can be trans- 
formed into one another in something akin to the sense ın which a 
caterpillar turns into a butterfly, then 1t 1s difficult to see how there 
could be any reasonably strict sense ın which mass turns into energy 

Let us turn to another interpretive issue. Our physics text said that 
E= md? “1s a statement that” mass can be converted into energy and 
vice versa But whether there happen to be, or even could physically 
possibly be, any processes that make mass disappear, putting energy 
in its place, is a different question from what the “rate of exchange” 
between mass and energy would be im such processes For instance, 
that certain nuclei are unstable, and that some mass 1s replaced by an 
“equivalent” quantity of energy when those nuclei decay, 1s a result of 
other laws, not just E = mc? 

That E = mc? tells us nothing about whether any physical processes 
allow energy and mass to be interconverted 1s exploited by Margaret 
Morrison? in a recent passage interpreting our favorite equation 


Does [E = mc?] mean that mass and energy are the same thing? [Iln 
order for this relation to imply a complete identity, 1t seems that a total 


5] examine this issue and a host of related problems in my Locality, Fields, Energy, 
and Mass An Introduction to the Philosophy of Physics (Malden, MA Blackwell, forth- 
coming), chapter 5 

6 The Scientific Letters and Papers of James Clerk Maxwell, Volume 2,P M Harman, ed 
(New York Cambridge, 1995), pp 396, 811, and Harman, “Edinburgh Philosophy 
and Cambridge Physics,” ın Harman, ed, Wranglers and Physvists (Manchester 
University Press, 1985), pp 204-24, especially pp 222-23 

7 Emstem's Revolution A Study m Heunstc (La Salle, IL Open Court, 1989) 

8 Unifymg Scientific Theories (New York Cambridge, 2000) 
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conversion of mass into energy should be possible However, such a 
conversion 1s not possible, because of the conservation of baryon num- 
ber, which says that the total number of these particles cannot change, 
they can be converted into other baryons, but cannot disappear entirely 
Baryons include, among other things, protons and neutrons, which 
make up most of the mass of ordinary matter, and therefore most of the 
mass of ordinary matter 1s not available for conversion into energy So 
we are left ın the rather odd position of asserting an equivalence 
between mass and energy, but not an identity that would allow one to be 
fully transformed into the other (iid, p 182) 


These remarks correctly recognize that E = mc? 1s logically consistent 
with limitations imposed by other laws on the processes that allow 
mass to be converted into energy But these remarks may mislead us 
regarding what these lumitations actually are Baryon number conser- 
vation does not prevent baryons (such as protons and neutrons) from 
being “fully transformed” into energy (or vice versa) A proton has 
baryon number +1 and an antiproton has baryon number —1, so 
their mutual annihilation (with the creation of photons, quanta of 
light possessing energy and having baryon number 0) does not violate 
baryon number conservation Of course, antiprotons are not "ordi- 
nary matter,” and so Morrison 1s correct insofar as she 1s suggesting 
that without the assistance of extraordinary matter, most of the mass 
of ordinary matter 1s not available for conversion into energy 

Nevertheless, I do not see why Mornson insists that for mass and 
energy to be the same thing, a total conversion of mass into energy 
must be physically possible Even 1f mass and energy were essentially 
the same (like water and steam), ıt could be a natural law that 90% 1s 
the maximum fraction of the initial mass that a process can turn into 
energy Or it could simply be that there ıs no process, permitted by 
the laws of nature, that turns mass entirely into energy, just as ıt could 
be that there 1s no process, permitted by the laws of nature, that 
enables diamond to be turned into graphite (though both are forms 
of carbon) 

We have now arrived at the central claim of the textbook passage 
that according to E = mc?, “mass and energy are two forms of the 
same thing " Some authors try to slide from the interconvertibility of 
mass and energy to their identity Here, for instance, 1s Max Jammer? 


As we have seen, the conversion of an electron-positron pair, for exam- 
ple, into gamma radiation or its mirror phenomenon is an incontestable 
experimental confirmation of the assertion of the theory of relativity that 


? Concepts of Mass (Cambridge Harvard, 1961) 
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mass and energy are mutually and completely interconvertible This 
state of affairs raises the following questions Are not the two entities 
which are interchangeable essentially the same? Is not what 1s generally 
spoken of as an equivalence in reality an identity? Are therefore not 
“mass” and “energy” merely synonyms for the same physical reality, 
which may perhaps be termed “massergy” (tbid , p 184)? 


Jammer clearly intends the answer to be “yes,” but from the fact that 
one thing can be converted into the other, ıt simply does not follow 
that the two are actually different forms of the same thing James 
Prescott Joule, for instance, famously suggested that all forms of 
energy (heat, kinetic energy, gravitational potential energy, chemical 
bond energy, electrical energy, and so on) are interconvertible, but 
he did not go so far as to hold that they are all different forms of the 
same thing In 1843, he showed experimentally that (what he called) 
the “mechanical equivalent of heat" 1s (1n modern units) 4 18 Joules 
of energy per calorie of heat In other words, Joule demonstrated that 
a 1 kg mass falling through a distance of 42 67 cm contributes exactly 
enough mechanical energy (ın doing work by, say, turning a stirring 
paddle immersed ın water) to add 1 calorie of heat to the water But 
he saw this “equivalence” as involving how much of one thing re- 
places a given quantity of the other—their fixed “rate of exchange " 
He did not understand the mechanical equivalent of heat as we do 
today as a conversion factor, that 1s to say, a way to change our units 
for measuring the same thing, because heat 1s just a particular form 
of energy !° Yet this ıs how “mass-energy equivalence” 1s often under- 
stood In the words of a fine relativity text by Edwin Taylor and John 
Archibald Wheeler! 


[J]oules and kilograms are two units— different only because of histor- 
ical accident—for one and the same kind of quantity, mass-energy 
The conversion factor c?, hke the factor of conversion from miles to 
feet, can today be counted, if one wishes, as a detail of convention, rather 
than as a deep new principle (2d, p 197) 


As Fanstein?? himself writes 


10 See Yuhuda Elkana, The Discovery of the Conservation of Energy (Cambridge 
Harvard, 1974) Einstem writes "[E]ach absorption or release of energy brings 
about, respectively, an increase or decrease of the mass of the body involved Energy 
and mass appear as equivalent quantities 1n the same way that heat and mechanical 
energy do"—"On the Development of Our Views Concerning the Nature and 
Constitution of Radiation,” in The Collected Papers of Albert Ernstem, Volume 2, A Beck, 
trans , pp 379-94, on p 386 

11 Spacetime Physics (San Francisco Freeman, 1966) 

12 The Meaning of Relativity (Princeton University Press, 1953) 
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Mass and energy are therefore essentially alike, they are only different 
expressions for the same thing (zbd, p 45) 


But how is mass-energy equivalence in this ontological sense (not 
merely in terms of a “rate of exchange”) to be understood? 

One problem arises from the fact that under a standard interpre- 
tation, relativity theory denies the objective reality of various proper- 
ties that we ordinarily assign to material bodies (such as their length 
and velocity) and to events (such as their separation in space and 
their separation 1n time). Each of these quantities 1s frame dependent, 
none 1s "Lorentz invariant”—that 1s, the same ın every inertial frame 
of reference Only what ıs the same in every inertial frame is a 
genuine feature of reality The value that any frame-dependent quan- 
tity assumes in a given inertial frame reflects not just reality, but also 
that reference frame’s own particular perspective The Lorentz in- 
variant quantities are exactly those which depend only on how the 
universe really 1s, uncontaminated by any contribution from us in 
describing the universe 

Consider a body’s mass m (Here I mean what is sometimes called 
the body's “rest mass"—though, as I shall explain, I prefer to call ıt 
simply the body's “mass ") This quantity ıs Lorentz invariant, and 
hence objectively real But the body's energy ıs not, since its energy 
depends on its speed, and its speed vis plainly frame dependent !? In 
particular, relativity says that a body's energy E 1s myc?, where y = 
1/\/[1 — (v?/c?)] Indeed, Heaviside made a similar point in a 
classical context, where a body's kinetic energy is (¥4)mv2 One 
reason that Heaviside doubted energy’s substantiahty 1s that a body's 
energy depends on its velocity and so 1s frame dependent, and 
Heaviside believed in what he called “the relativity of motion” that 
there are 1n reality no absolute velocities since only what 1s the same 
1n every inertial frame can be a genuine feature of reality (op cit, pp 
525-26, 521) !* Likewise, the path taken by energy flowing in a system, 
being frame dependent, lacks objective reality 

A body's energy E and momentum f (a vector—I suppress any 
vector notation—the magnitude of which, in relativity theory, 1s myv), 
taken together, form an invariant, mc 


VIE)? ~ p*] = VI (mye)? — (myo) ?] = myV c? — v] 
= m(1/V[1 — (v?/c2)])eV[1 — (v?/c9)] = me 
1? At this point, you might wonder how E can possibly equal mc? when E 1s frame 


dependent but m and c are the same in every inertial frame Stay tuned 
14 See also Heaviside, Electromagnetic Theory, Volume 1 (London Benn, 1922), p 75 
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To appreciate this invariant, let us use space-time geometry At a 
given moment, a body 1s associated with an energy-momentum 4-vec- 
tor P The lengths of P's three spatial components (along the x, y, and 
z axes) are given by the components p,, p,, and p, of the body's 
momentum, which equal myv,, myv,, and myv, (where v, v,, and v, 
are, respectively, the x, y, and z components of the body's velocity) In 
other words, the body's momentum f 1s the 3-vector formed by P's 
spatial components The temporal component of P 1s myc, which 1s 
E/ c In Euclidean geometry, the square of the hypotenuse's length is, 
of course, the sum of the squares of the other two sides’ lengths But 
1n space-time geometry, the square of a 4-vector’s length 1s the square 
of its temporal component's length manus the square of the length of 
the 3-vector formed by its spatial components Accordingly, the 
length of the 4-vector Pis \/[(E/c)? — p°], which (as we just saw) 
equals the invariant mc In short, P's length 1s an mvariant quantty, 
though its decomposition into energy and momentum components 1s 
frame dependent, just as the electromagnetic field appears as differ- 
ent combinations of electric and magnetic fields from the standpoints 
of different inertial frames Relativity thus unifies energy and momen- 
tum 1n exactly the same sense as 1t unifies the electric and magnetic 
fields !5 
Classically, a body's momentum f = mv, whereas relativistically, p = 
myv By calling a body's my its “relativistic mass” M, momentum in 
relativity remains mass times velocity, and so an analogy can be drawn 
between relativistic and classical physics Whereas m 1s invariant, M1s 
not, since y 1s obviously frame dependent in being a function of v 
One version of our favorite equation, E = Mc?, then follows from E 
= myc? The celebrated “E = mc?,” featuring the body's (invari- 
ant) mass m, holds only when the body 1s at rest (that ıs, when its p = 
0)16 since E = mc? then follows from the more general 


VIG 9)? — p°] = me 


When the body ss at rest (v = 0), its y = 1, and so its "relativistic mass" 
my equals its mass m That ıs why “m” is sometimes called the body's 


15 That f 1s conserved ın every inertial frame entails that E 1s so conserved as well, 
since pin one frame is a function of p and Ein another But this entailment alone 
does not suffice to unify momentum and energy—does not reveal them to be 
nothing more than aspects of the same single real thing—any more than momen- 
tum, energy, and mass are unified ın classical physics by the fact that momentum 
conservation follows from energy conservation and mass conservation in every 
inertial frame 

16 Thus the question posed in footnote 13 1s answered E = mc? holds only for E 
in the p = 0 frame 
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“rest mass” its “relativistic mass” when it 1s at rest Because the 
so-called “relativistic mass” 1s not an invariant quantity (and the term 
‘rest mass’ refers back to ‘relativistic mass’), the best thing to do in 
order to avoid confusing frame-dependent quantities with invariant 
ones 1s just to avoid the terms ‘relativistic mass’ and ‘rest mass’, and 
instead to stick solely with ‘mass’ for the invariant quantity symbolized 
m! 

Now we can ask a question regarding the proper interpretation of 
E = mc? Given that mass ıs a real property (since ıt ıs Lorentz 
invariant) whereas energy 1s not, how can mass and energy be the 
same thing (“mass-energy”) ın the same sense that distance in miles 
measures the same real thing as distance in feet? Moreover, in what 
sense can mass be converted into energy (as when a trinum nucleus 
decays) when mass and energy are not on a par in terms of their 
reahty? We shall look into this 

On the basis of E = mc? (when f = 0), Fainstein?? says that "[t]he 
mass of a body 1s a measure of its energy content” (in the p = 0 
frame) But beware this remark may mislead us by suggesting that 
energy content 1s a basic, objective fact whereas mass 1s merely some 
sort of indicator, symptom, or manifestation of energy In fact, how- 
ever, mass rather than energy 1s the Lorentz invariant, and hence real 
quantity 1° Accordingly, I cannot agree with remarks lıke Bertrand 
Russell ’s?° 


[W]e must abandon attempts to say what energy ıs We must say simply 
there ıs something quantitative, to which we give the name "energy", this 
something ıs very unevenly distributed ın space, there are small regions 
in which there 1s a great deal of it, which we call “atoms”, and are those 
in which, according to older conceptions, there was matter Mass 1s 
only a form of energy [tis energy, not matter, that ıs fundamental in 
physics (1d , pp 27, 291) 


17 That Einstein did not regard “rest mass” and “relativistic mass” as standing on 
the same ontological footing 1s evident from his many remarks urging others to 
introduce as “mass” only a quantity independent of motion See Arthur Fine, 
“Appendix” to John Earman, “Against Indeterminacy," this JOURNAL, LXXIV, 9 
(September 1977) 535-38, p 538, and Lev Okun, “The Concept of Mass," Physics 
Today, XLII, 6 (June 1989) 31-36 

18 "Does the Inertia of a Body Depend Upon Its Energy Content?" in The Collected 
Papers of Albert Einstein, Volume 2, pp 172-74, see p 174 See also his Relativity The 
Special and the General Theory (New York Crown, 1931), pp 55-56 

19 Of course, there 1s an objective fact regarding energy content in the p = 0 frame, 
but this 15 a cheap sort of objectivity it holds in all frames because it refers to a 
specific frame 

20 Human Knowledge Its Scope and Limits (New York Simon and Schuster, 1948) 


228 THE JOURNAL OF PHILOSOPHY 


How can mass really be nothing but concentrated energy”! that 
moves through space, “a form of bound energy,"? when mass rather 
than energy ıs the invariant quantity? Because energy 1s not Lorentz 
invariant, 1t seems difficult to endorse these remarks from Werner 
Heisenberg?? 


Since mass and energy are, according to the theory of relativity, essen- 
tally the same concepts, we may say that all elementary particles consist 
of energy This could be interpreted as defining energy as the primary 
substance of the world (zbid, p 67) 


This kind of talk filters down to monographs for guiding physics 
teachers, such as one by Robert M Cotts and Robert W Detenbeck,?* 
which says, “Energy appears in still another wonderful disguise as 
mass itself” (ıd, p 66) 

Here 1s a final issue regarding the ontological significance of 
Einstein’s famous equation In any inertial frame, a closed system's 
total energy and total momentum (as defined relativistically) are each 
“conserved " That 1s, 1n any mertial frame, the system's total energy 
does not change as time passes, and likewise for its total momentum 
(But these totals differ in different inertial frames, total energy and 
total momentum are frame dependent and therefore not real ) Since 
E and p are conserved and we just saw an equation by which m 1s 
determined by E and f, a closed system's total m must be conserved 
n any inertial frame But we saw earlier that when mass “turns into” 
energy (as in the decay of a tritum nucleus), there 1s a “mass defect” 
and so, apparently, mass fails to be conserved (In these cases, text- 
books commonly refer to some “mussing mass” Am) We must look 
into how mass conservation can be reconciled with "mass defects " 


II WHAT IS MASS? 
In classical physics, a body's mass 1s often interpreted as the amount 
of some "stuff" (matter) of which the body ıs made In relativity, 
however, this interpretation cannot be correct That 1s because prop- 
erties that represent the quantity of some "substance" must obey the 
following principle the total quantity of some sort of stuff in a system 
is the sum of the various quantities of that stuff belonging to the 


21 Hermann Weyl, Space, Tıme, Matter (New York Dover, 1922), p 200 

22 Hans Pagels, The Cosmic Code (New York Simon and Schuster, 1982), p 37 

23 Physics and Philosophy (London George Allen and Unwin, 1958) The difficul- 
ties for understanding energy as a locally well-defined, conserved quantity become 
even more substantal as we pass to the general theory of relativity See Carl Hoefer, 
“Energy Conservation ın GTR,” Studies in History and Philosophy of Modern Physics 31B, 
2 (fune 2000) 187-99 

24 Matter 1n Motion (Seattle Washington UP, 1966) 
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system's parts (where those parts are finite in number, nonoverlap- 
ping, and together exhaust the system) ?* For example, since the 
density of a whole 1s not the sum of the densities of its parts, density 
does not measure the amount of some stuff, and similarly for tem- 
perature and velocity On the other hand, mass in classical physics is 
“additive” ın this way But in relativity, mass 1s not additive 

To see this, let us work ın an inertial frame ın which p=0 (There 
always exists such a frame, and since mass ıs invariant, the result in 
that frame 1s applicable to any other inertial frame ) Then (as I have 
already mentioned) 


E= mc? 


Consider a system of finitely many constituents, where each exerts on 
the others only negligible forces when they are not in contact—such 
as a gas consisting of many molecules ?9 Since the system's total 
energy 1s the sum of the energies Ej, E>, of its constituents, we have 


m=(1/e)x[E,+ E+ | 
Recall that for any constituent (say, the zth one), 
E,= mye? = me?/V[1 - (vi/c?)] 
Now for any x where x? < 1, 
1/Vl-x-1-0/2x-[0:3/0 H+ 3 5)/Q 4 918 + 


Let x be (v2/c?) If v, 1s low compared to c, then the terms with x? 
and higher powers of x are negligible, and so we can use the approx- 
imation 


E,= me? + (1/2)m,v? 


Notice that this 15 just m,c? plus the :th constituent’s classical kinetic 
energy Substituting for Ej, Es, 


m (1/c?)[mc? + (1/2) mv} + mac? + (1/2) mvs + ] 


and so 


?5 See Norman Robert Campbell, Foundations of Science (New York Dover, 1957), 
pp 282-83, 286 [originally Physecs The Elements (New York Cambridge, 1920)] For 
related discussion, see Helen Morris Cartwright, “Amounts and Measures of 
Amount,” Noûs, 1x, 2 (May 1975) 143-64, pp 147-49 

26 This restriction means that there 1s no potential energy, there are no fields, and 
so forth, 1n classical terms, the only energy 1s kinetic energy I relax this restriction 
in chapter 8, section 5 of my Locality, Fields, Energy, and Mass and go on to examine 
the mass possessed by the electromagnetic field 
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m=[m + m+ J+ (1/6)[(1/2) mv} + (1/2)mu+ ] 


Hence, the system's mass m exceeds the sum of its constituents’ 
masses (They differ by an amount that reflects the constituents’ 
kinetic energies 1n the frame where the system has zero total momen- 
tum This will shortly be important ) So we cannot think of a body's 
mass as the amount of matter that forms 1t, where matter 1s a "stuff " 

Note that the system's mass zs the sum of its constituents’ relativistic 
masses m,y, ın the frame where p = 0 That 1s because in that frame, 


m= (1/e*)[Ey TER Je Q7 c)[m y,c? + myc t+ ] 
= mY, + may. 


But since a body’s relativistic mass 1s frame dependent, ıt 1s not the 
body's quantity of any sort of real "stuff " So the fact that the system's 
(invariant) mass 1s the sum of its constituents’ relativistic masses in 
the p = 0 frame does not show that the system's mass ıs its total 
quantity of some sort of “stuff” of which it 1s made 

Bearing 1n mind that mass 1s not additive, consider the following 
argument, some version of which appears 1n most relativity texts 
Suppose two bodies, each of mass mo, crash into each other and stick 
together, forming one body at rest of mass m, In the words of one 
fine text by Robert Resnick and David Halliday?’ 


[Although] the total energy E of the system of particles will. be con- 
served [and] relativistic mass 1s conserved ın this collision, rest mass [what 
Iam calling simply “mass”] ıs not, m, the rest mass after the collision, 1s 
greater than 2mp, the rest mass before the collision After the coll- 
sion, no kinetic energy remains In place of the “lost” kinetic energy, 
there appears internal (thermal) energy, recognizable by the rise in 
temperature of the colliding particles [By E = mc?,] (increase in 
thermal energy) = (increase im rest mass) (c?) Thus we see that the 
decrease ın kinetic energy for this isolated system ıs balanced by a 
corresponding increase m mass energy (:bıd, p 110) 


So the text says that although the system ıs "isolated," mass 1s not 
conserved! This ıs not correct In this argument, the sum 2mo of the 
masses before the collision is shown to be unequal to the mass m, of 
the body formed in the collision But 27, 1s not “the rest mass before 
the collision" (that 1s, the mass of the system before the collision) 
because mass 1s not additive The argument that mass 1s not conserved 
rests on the mistaken assumption of mass’s additivity To say that 


27 Basic Concepts ın Relativity and Early Quantum Theory (New York Wiley, 1985, 2d 
ed) I have substituted ‘m,’ for their symbol for the resultant body's mass 


THE MOST FAMOUS EQUATION 231 


energy 1s conserved whereas mass 1s not helps to make energy seem 
like some sort of real stuff, leading to remarks like Heisenberg’s 
(already quoted) Of course, it 1s also simply confusing for the text I 
have quoted to say on one page that the isolated system’s rest mass 1s 
not conserved and on the very next page that "if we consider a closed 
system then we may regard the rest mass of the body (or of the 
system) as constant ” 

If a body’s mass 1s not its total quantity of some sort of stuff of 
which it 1s made, what 1s a body's mass? A body's mass is the property 
it possesses which determines the acceleration ıt undergoes in re- 
sponse to a force F= p' (as in classical physics), where in relativity, 
$ = myv and p' 1s the rate at which p15 changing When referring to 
a body's "mass," then, we must be thinking of that body as a thing that 
can feel a force and respond to ıt (by moving) as a umt Nevertheless, 
no macroscopic body is elementary, any macroscopic body 1s also a 
system of bodies Its motion, then, is nothing but the motions of its 
constituents, and these motions are determined by thar masses and 
the forces that they feel The remarkable fact 1s that the law of nature 
by which the constituents! motions are determined by their masses 
and the forces they feel 1s the same as the law by which the macroscopic 
body's motion 1s determined by its mass and the forces it feels F= pP 
= (myv)' In other words, the law “scales up "28 

Let us see precisely why this fact 1s so remarkable Macroscopic 
bodies are presumed to have elementary constituents (that 1s, con- 
stituents having no constituents themselves) and there 1s a law relat- 
ing the force felt by an elementary constituent, the constituent’s 
motion ın response to that force, and a single parameter characterizing 
the constituent This need not have been, we can imagine a universe 
in which a constituent’s motion in response to forces is determined 
1n different ways by several of its properties (such as its shape, volume, 
and chemical activity) rather than there being a single parameter (its 
“mass”) that captures everything about the constituent affecting 1ts 
motion 1n response to a force But from the fact that such a param- 
eter exists for elementary bodies, ıt does not follow that a single 
parameter characterizing the macroscopic body 1s all that 1s needed 
to allow its motion to be related to the force that ıt experiences If a 
body’s “mass” 1s defined as that single parameter, then even given the 
remarkable fact that the fundamental constituents of macroscopic 


28 A similar remark appears in Richard Feynman, Robert Leighton, and Matthew 
Sands, The Feynman Lectures on Physics, Volume 1 (Reading, MA Addison-Wesley, 
1963), p 19-2 For related discussion, see David Bohm, The Special Theory of Relativity 
(New York Routledge, 1996), pp 82-85, 110-18 
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bodies have masses, it 1s a remarkable fact that macroscopic bodies 
have masses too 

Furthermore, from the fact that macroscopic bodies have constit- 
uents with masses and that macroscopic bodies have masses too, 1t 
does not follow that the law by which the elementary constituents’ 
motions are determined by their masses and the forces they feel 1s the 
same as the law by which the macroscopic body’s motion 1s deter- 
mined by its mass and the forces it feels If a body's “mass” ıs defined 
as the quantity with which it 1s associated that plugs into the particular 
law relating an elementary body's motion to the forces it feels, then 
once again, 1t 15 a remarkable fact that macroscopic bodies have masses 
too 

In short, neither of these conceptions of what mass 2s requires that 
macroscopic bodies have masses, even considering that their elemen- 
tary constituents do Furthermore, given either of these conceptions 
of mass and that a macroscopic body has a mass, there 1s nothing 
inevitable about its mass being the sum of its elementary constituents’ 
masses (as ıs the case classically but not relativistically) So these 
conceptions of mass should inoculate us against the temptation to 
think that a body's “mass” ts defined as the quantity of matter com- 
posing it 

It 1s certainly convenient for us that in order to predict a macro- 
scopic body's motion in response to a force exerted upon it from 
outside it (an “external force”), we do not need to determine how 
each of its constituents responds to the various forces exerted upon 
it by other constituents (“internal forces”) We can simply think of the 
macroscopic body's mass m as if it were concentrated in a point 
particle located at the body’s “center of mass” and then determine 
this particle’s response to the external force It1s also convenient that 
the law governing this response 1s the same as the law governing the 
responses of the body’s elementary constituents to the forces they 
feel Otherwise, our investigations into the behavior of macroscopic 
bodies would not necessarily have left us well equipped to understand 
the behavior of their elementary constituents 

Let us see exactly why 1n classical physics, Newton's second law 
(F= ma, where Fıs a force felt by a body of mass mand a 1s the body's 
acceleration 1n response to that force) “scales up" 1n that 1t governs 
the motions not only of the elementary bodies, but also of the center 
of mass of a system of those bodies To simplify the discussion, 
suppose there are only three elementary bodies, each exerting forces 
on the others Let m, be the :th body's mass, and let ay be the 
component of the zth body’s acceleration that 1s caused by the force 
F, exerted by the jth body on the zth body By Newton’s second law, 
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F, = m, a, By Newton's third law (“Every action has an equal and 


y 
opposite reaction”), F, = —F,, and so m, a, = —m, a, Now consider 


1 
bodies 2 and 3 as forming a single system of bodies Let us see why 
that system’s motion 1s likewise governed by Newton’s second law 
The system's center of mass 15 a kind of average of the positions of the 
system’s constituents Each constituent’s contribution to the average 
1s proportional to 1ts mass So the system's center of mass 1s a weighted 
average of its constituents’ positions—weighted by their masses If x, 
1s the position of the :th body, then the position x of the center of 
mass of the system composed of bodies 2 and 3 1s given by 


x= [mxo + maxs]/ [ms + ms] 


For Newton ’s second law to govern this system's motion would be for 
the force exerted on the system (namely, the sum of the forces 
exerted on the system's constituents by body 1, the body outside the 
system Fə; + F31) to equal the system's mass (mọ + ms, in classical 
physics, mass 1s additive) multiplied by the acceleration a of its center 
of mass From the above equation for x, ıt follows that 


a = [ma + msas]/ [ms + ms] 
So Newton’s second law governs the system's motion if and only if 
Fy + Fa, = (mg + mj)a = (m, ms)[ mgd, + maas]/ [ms + ms] 
= Mol + m3a3 
By Newton’s second law applied to the constituents, 
ma = Fy + Fog 
mas = Fy + Fo 
So Newton’s second law governs the system’s motion if and only if 
Fy + Fy, = Fy + Fog + Fo, + Foo 
But this 1s obviously true since (by Newton’s third law) 
Fy, = -F 


So the system consisting of bodies 2 and 3 behaves in relation to body 
1 in just the way that one elementary body behaves in relation to 
another (By expanding the system to encompass more and more 
elementary bodies, we could work up to the conclusion that macro- 
scopic bodies obey Newton’s second law ) Thus, it takes the cooper- 
ation of Newton’s third law for Newton’s second law to scale up 
Since either or both these laws could have been different, there 1s 
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nothing logically mandatory about the fact that the law relating a 
body's motion to the forces it feels scales up 

It is likewise the case in relativity theory that a collection of bodies 
behaves as a single body with a mass m located at the collection’s 
center of mass According to relativity, the system’s center of mass 1s 
the weighted average of its constituents’ positions—weighted by their 
energies (or, what comes to the same thing, their relativistic masses) 


x = [myx + my2% + Vimy, + my + ] 


(This formula 1s restricted to the case where the constituents exert 
forces on one another only when they are in contact—in other words, 
when no fields are involved At the start of this section, I restricted 
myself to this case for the sake of simplicity ) For example, if the 
system 1s closed (that 1s, feels no external forces), then (since it 
behaves as a single body) its center of mass undergoes no accelera- 
tion Notice that the weights in the weighted average are frame- 
dependent quantities (since the 2th weight involves y,, which 1s a 
function of the :th body's velocity, which 1s obviously frame depen- 
dent) So the location of the system's center of mass 1s not Lorentz 
invariant 
Now we are 1n a position to appreciate the key point 


III THE CONVERSION OF MASS TO ENERGY IS NOT 
A PHYSICAL PROCESS (OR VICE VERSA) 


The frame in which the system's total p = 0 1s the frame m which the 
system's center of mass ıs at rest As we have already seen, the system's 
mass increases as the kinetic energies of its constituents increase 1n 
the p = 0 frame—that ıs, as its constituents move about more quickly 
relative to the system's center of mass That 1s because in this frame, 
the system's 


m-—E/c-(/c)E-T-E^MY | 


and so any increase AF ın the constituents’ total kinetic energy 
contributes AE / c? to the system's mass, though it has no effect on 
any constituent’s mass Imagine, for example, a ball of hot gas, its 
molecules whizzing around randomly We add to the constituents’ 
total kinetic energy by heating the gas We thereby increase the gas's 
mass In what sense 1s this the “conversion” of energy into mass? 
Suppose that initially we think of the gas as a collection of mole- 
cules, each with 1ts own mass In other words, we treat each of these 
bodies as individually feeling forces and being accelerated by them, 
its acceleration depending on its mass Accordingly, we characterize 
the heat as having boosted various molecules’ kinetic energies, but 


THE MOST FAMOUS EQUATION 235 


not their masses Suppose we then change our perspective by consid- 
ering the ball of gas as a single body that feels external forces and 
moves about as a unit From this perspective, the kinetic energy 
contributed by the heat becomes part of the gas’s mass This “con- 
version” of energy into mass 1s not the transformation of one kind of 
stuff into another, since neither energy nor mass 1s a kind of stuff or 
measures the amount of some kind of stuff (energy because it 1s not 
Lorentz invariant, mass because it 1s not additive) But more impor- 
tantly, we have just seen that this “conversion” of energy into mass 1s 
not any kind of real physical process taking place in nature We 
"converted" energy into mass simply by changing our perspective on the 
gas from treating ıt as many bodies to treating it as a single body 

Let us see that again Suppose we begin by treating the gas as a 
single body The body 15 heated Heat energy flows into 1t and its mass 
increases by an equivalent amount It looks like energy ıs being 
converted into mass, fluid, gossamer energy has “solidified” or “con- 
gealed” into matter, “the extended hard ‘stuff’ with which we are all 
familiar” (in Zahar’s words) I argue, however, this 1s not a real 
process, rather, it 1s just an artifact of the perspective we have 
adopted No such “conversion” occurs on a different perspective Let 
us begin the gas’s story again and this time, let us treat the gas as 
many bodies We find no energy being transformed into matter as the 
heat 1s being added to the gas—so long as we continue to regard the 
gas as many bodies, none of those bodies increases its mass while the 
gas ıs heating up The heat energy goes into their kinetic energies 
relative to the gas's center of mass. 

So on the first perspective, energy was converted into mass, 
whereas on the second, no such conversion occurred Furthermore, 
at whatever point in the story we choose to switch our perspective on 
the gas, we can make 1t appear that at that point, energy 1s being 
transformed into mass—even if at that point, heat ıs no longer being 
added to the gas! Again let us begin by treaung the gas as many 
bodies, and let us maintain this perspective throughout the heating of 
the gas Suppose the heat source is then shut off Nothing more is 
now happening to the gas But we decide at this point to shift our 
perspective ın telling the gas’s story, henceforward we treat the gas as 
a single body In this shift, the gas molecules’ total kinetic energy (in 
the p = 0 frame) contributes an equivalent amount to (and so 
"becomes part of") the gas's mass But obviously, no physical change 
accompanies this "transformation", again, nothing 1s happening to 
the gas This "conversion" of energy into mass ıs not a physical 
process Thus, whether and when a conversion of energy into matter 
occurs in the story of the gas depends on the perspective from which 
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we elect to tell that story and any shifts of perspective we make in the 
course of telling it 

The distinction between forms of “internal” and “external” energy 
is often drawn by relativity texts ?? But the point ıs seldom made that 
because this distinction 1s a scientific convention, so 1s the "conver- 
sion" of energy to mass or vice versa As far as science 1s concerned, 
the line between bodies— between "internal" and “external”—is a 
convention (that 1s, not built into the universe, but rather drawn onto 
it by us) because of the remarkable fact that I emphasized in the 
previous section that any system ıs characterized by a mass determin- 
ing the way it responds (by moving as a unit) to the forces ıt feels as 
a whole (that ıs, to external forces) As Einstein?? says "Every system 
can be looked upon as a material point as long as we consider no 
processes other than changes 1n its translation velocity as a whole” 
(bed, p 225) 

A shift in “perspective” (not reference frame!), 1n what are being 
treated as single bodies, obviously takes place in the course of the 
textbook discussion (quoted earlier) of two bodies colliding and 
subsequently sticking together The text treats the system as initially 
consisting of two bodies, each with its own mass But the text regards 
the system after the collision as forming a single body The “conver- 
sion” of energy into mass in this case 1s an artifact of this shift in 
perspective 

Sometimes, a shift in perspective 1s much subtler Return to the 
trittum nucleus (one proton, two neutrons) that decays into a heli- 
um-3 nucleus (two protons, one neutron) and an electron and an- 
tineutrino that fly off at high speed There 1s a “mass defect” in that 
the masses of a helium-3 nucleus, an electron, and an antineutrino 
add up to less than the mass of a trittum nucleus The missing mass 
1s said to have been “converted” to the kinetic energies of the result- 
ing bodies But this “conversion” of mass into energy 1s not real, ıt 1s 
an illusion produced by a subtle shift ın our perspective (The trans- 
formation of the tritium's neutron into a proton, an electron, and an 
antineutrino 1s, of course, a real occurrence ) We treated the system 
as initially forming a single body a tritium nucleus But we treated the 
system after the decay as consisting of three bodies, each with its own 
mass The system's mass after the decay 1s the same as the system's 
mass before the decay There is no “mass defect” here, mass 1s 
conserved The “mass defect" appears to arise from the fact that the 


29 See, for example, Resnick and Halliday, pp 111-12 
80 “Elementary Derivation of the Equivalence of Mass and Energy,” Bulletm of the 
Amencan Mathematical Society, XLI, 4 (April 1935) 223-30 
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sum of the three masses after the decay 1s less than the system’s mass 
before the decay (the difference reflecting the three bodies’ kinetic 
energies in the p = 0 frame) But the sum of the three masses after 
the decay 1s less than the system’s mass after the decay Mass 1s not 
additive, and our expectation that 1t 1s additive (arising because we 
expect 1t to measure the amount of some stuff) leads us to refer to a 
"mass defect'—to ask where the “missing mass" has gone and to 
conclude that it has turned into energy The “mass defect" results not 
from some physical transformation of matter-stuff into energy-stuff, 
but rather from our illicitly trying to view the system from two 
different “perspectives” at the same time 

Thus, the conservation of mass does not conflict with the original 
tritum nucleus’s mass exceeding the sum of the postdecay bodies’ 
masses The “mass defect" ıs not real, it ıs produced not by the decay 
of the tritum nucleus, but by our treating the postdecay system as a 
collection of bodies even while treating the predecay system as a 
single body The fact that Am of the system's initial mass "becomes" 
energy (Am)c? when we think of the postdecay system as a collection 
of bodies, each with its own mass, does not mean that mass 1s really 
nothing but energy or that mass and energy are like distance in feet 
and m miles— different ways of measuring the same property Again, 
the "conversion" of mass into energy occurs because we have shifted 
our perspective, not because the nucleus has decayed Unlike the 
conversion of a caterpillar into a butterfly, the conversion of mass 
into energy (or vice versa) 1s not a physical process 

A body's energy myc? 1s its kinetic energy plus mc? Since this latter 
energy depends only on its mass, we can think of its mass as “associ- 
ated with” a certain quantity of energy a “mass energy" mc? Accord- 
ingly, ıt might sometimes help to think of relativity as having turned 
the classical law of energy conservation mto the conservation of 
“mass-energy” by having added a term for the energy “frozen” as a 
body's mass In Einstein 's?! words. 


Before the advent of relativity, physics recognised two conservation laws 
of fundamental importance, namely, the law of conservation of energy 
and the law of conservation of mass, these two fundamental laws ap- 
peared to be quite independent of each other By means of the theory 
of relativity they have been united into one law (2bid, p 54) 


But the fact that, in totaling a system’s energy, we must include not 
only the more familiar forms of energy but also terms of the form mc? 


31 Relativity The Special and the General Theory 
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does not mean that mass really 2s energy (or “massergy”) or “that mass 
1s now viewed as a form of energy”? like 1ce 1s a form of water Mass 
1s a real property whereas energy 1s not Mass-energy 1s not some sort 
of real stuff that 1s neither created nor destroyed 

Just as there 1s only a single object, the electromagnetic field, which 
in different inertial frames appears as different combinations of 
electric and magnetic fields, so there 1s only a single real property, the 
body’s mass, which in different frames appears as different combina- 
tions of its energy and momentum This 1s the sense in which rela- 
tivity theory unifies energy, momentum, and mass 

MARC LANGE 

University of Washington 


32 Marshall Spector, Methodological Foundations of Relatwistic Mechamcs (Notre 
Dame University Press, 1972), p. 151 


DOES PERCEPTION HAVE A NONCONCEPTUAL CONTENT» 239 


DOES PERCEPTION HAVE A NONCONCEPTUAL CONTENT?* 


he question posed ın my title is one that has been vigorously 

debated in philosophy for almost twenty years now In one 

form or another, the idea that perceptual experience has a 
content that 1s nonconceptual 1s found in the writings of, among 
others, José Bermüdez, Tim Crane, Fred Dretske, Gareth Evans, 
Susan Hurley, myself, and Michael Tye ! The idea has been strongly 
opposed by John McDowell, Sonia Sedivy, and, most recently, by Bill 
Brewer ? The question has generated so much discussion because the 
possibility—or otherwise—of nonconceptual content 1s inseparable 
from some fundamental issues about the individuation of conceptual 
content, about the nature of concept possession, about the nature of 
rationality, about the relation between animal and human percep- 
tion, and even about our conception of objectivity—which all turn, in 
part, on the possibility of nonconceptual content in perception That 
my question 1s inseparable from these wider issues I shall try to make 


* The present paper was delivered at the Certosa di Pontignano, Siena in May 
1999, under the auspices of the Universita degh Studi di Siena. I am grateful to 
participants at the meeting for discussion, and to later audiences at Bonn, Essen, 
Oxford and New York University I have retained the largely informal style of a 
conference talk Special thanks to Bill Brewer and Stephen Schiffer for valuable 
comments Some of the work on the present text was carried out while I held a 
Leverhulme Research Professorship, once again I thank the Leverhulme Trust for 
this invaluable support 

! Bermüdez, The Paradox of Self-Consciousness (Cambridge MIT, 1998), especially 
chapter 3, Crane, “The Nonconceptual Content of Experience,” ın Crane, ed , The 
Contents of Experience Essays on Perception (New York Cambridge, 1992), pp 136-57, 
Dretske, Naturalzng the Mind (Cambridge MIT, 1995), especially chapter 1, pp 
19ff , and earlier in Knowledge and the Flow of Information (Cambridge MIT, 1981), 
Evans, The Vaneties of Reference (New York Oxford, 1982), see index entry for 
“Conceptual and Non-conceptual Content”, Hurley, Consciousness in Action (Cam- 
bridge Harvard, 1998), especially chapter 4, my “Analogue Content,” Proceedings of 
the Anstotelan Society, Supplementary Volume Lx (1986) 1-17, and A Study of Concepts 
(Cambridge MIT, 1992), chapter 3, Tye, Ten Problems of Consciousness (Cambndge 
MIT, 1995), especially pp 137ff A rather different but related use of the notion of 
nonconceptual content ıs found in the work of Adrian Cussins see his paper “The 
Connectionist Construction of Concepts,” in Margaret Boden, ed , The Philosophy of 
Artificial Intelligence (New York Oxford, 1990), pp 368-440 

2 McDowell, Mind and World (Cambridge Harvard, 1994), especially Lecture III 
and Afterword, Part II, Sedivy, “Must Conceptually Informed Perceptual Experience 
Involve Non-Conceptual Content?" Canadian Journal of Philosophy, xxvi (1996) 
413-31, and Brewer, Perception. and Reason (New York Oxford, 1999), especially 
chapter 5 
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plausible as we proceed My specific aim here 1s to argue further for 
the thesis that perception does indeed have nonconceptual content s 


I THE CLAIMS OF NONCONCEPTUAL CONTENT 

The nonconceptual content of perception that 1s the topic of this 
paper ıs meant to be one species of representational content It 1s 
content that is evaluable as correct or as incorrect So we are not 
concerned here with whether there are nonrepresentational con- 
scious properties of perceptions, what are sometimes called sensa- 
tional properties The question I am now addressing concerns solely 
the class of representational contents of perception It 1s the issue of 
whether some of the representational content of perception 1s non- 
conceptual 

The discussions of recent years have focused on the fine-grained 
representational content of expenence When you look at the new 
Guggenheim Museum ın Bilbao, or see a new abstract sculpture, or 
the face of a person, you see each of these objects as having a quite 
specific shape and size Similarly, you see them as having quite 
specific shades of colors, surface textures, and contours Equally, 
when you hear a musical tone, there 1s a sense in which you perceive 
its pitch You may not recognize the tone—you may not have absolute 
pitch— but you can discriminate that pitch from many, many others 
if you are asked to compare it with another This discrimination’ 1s 
based on the way the tone sounds to you All parties to these discus- 
sions have acknowledged the fine-grained character of this represen- 
tational content What has been at issue ıs not its existence, but its 
character 

We shall not do justice to the fine-grained phenomenology of 
experience if we restrict ourselves to those contents which can be 
built up by referring to the properties and relations that the per- 
ceived objects are represented by the experience as possessing We 
must, 1n describing the fine-grained phenomenology, make use of the 
notion of the way in which some property or relation 1s given in the 
experience The same shape can be perceived ın two different ways, 
and the same holds for the shape propertes, 1f we regard them as 
within the representational content of experience. Ernst Mach's ex- 
ample of one and the same shape that can be perceived either as a 


3 shall confine myself m this paper to the fundamental philosophical motiva- 
tions for acknowledging nonconceptual content m perception, and the philosoph- 
ical challenges such acknowledgment must address There are many further 
important issues about the nature and role of nonconceptual content which arise in 
the light of psychological experiments on perception, and on action discussion of 
them will have to be left to another occasion 
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square or as a regular diamond is a familiar example These are not 
different shapes The shape of an object need not alter when it 
moves, and an object can be perceived either as a square, or as a 
diamond, in either of the standard orientations relative to the per- 
ceiver 

The need to use the notion of the way in which something 1s 
perceived 1s by no means special to the perception of spatial proper- 
ties and relanons Suppose middle C and the F-sharp Just above it are 
played simultaneously on a piano The interval thus sounded may be 
heard either as an augmented fourth, or as a diminished fifth In the 
former case, the higher tone of the interval ıs heard as the seventh of 
the presumed tonic scale * In the latter case, it ıs heard as the fourth 
of that scale In neither case need these music-theoretic descriptions 
be ones known to the perceiver, as part of his personal-level concep- 
tual repertoire We use them im a plausible theory about the nature of 
the two ways in which the interval can be perceived It is the way itself, 
and not the materials of music theory, which enter the description of 
the subject's experience. Even one and the same time interval may be 
perceived in two different ways It 1s perceived in one way, when it 1s 
perceived as the familiar time interval between your switching on 
your computer and its emitting the start-up sound It may be per- 
ceived in another way when the same interval 1s exactly that of the 
silence in a complete, but empty measure (bar) of a Haydn string 
quartet, in which case it may be heard as composed of two beats 

So, 1n characterizing the fine-grained content of experience, we 
need the notion of the experience represenung things or events or 
places or times, given in a certain way, as having certain properties or 
as standing 1n certain relations, also as given in a certam way Hence- 
forth I use the phrase the content of experience to cover not only which 
objects, properties, and relations are perceived, but also the ways in 
which they are perceived. The ways I have mentioned all contribute 
to the representational content of the expenence That is, when 
something 1s perceived in one of these ways, the claim that the object 
really 1s the way it 1s experienced as being 1s one which has a 
correctness condition 

Here, now, are five at least moderately intuitive claims about non- 
conceptual content which one might expect a believer 1n. noncon- 
ceptual content to defend Some of these claims are, prima facie, 
incompatible with the position that the representational content of 


* On the musical case, see the fine contribution of Mark DeBellis, Music and 
Conceptualization (New York Cambridge, 1995), especially chapter 2 
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experience 1s exclusively conceptual Before we start discussing them, 
here 1s a brief, wholly dogmatic, statement of each of the claims 

(1) The nonconceptual content of perceptual experience contrib- 
utes to making available to a thinker various perceptually based 
concepts Only a thinker who has a perceptual experience with a 
certain kind of nonconceptual representational content can employ 
such perceptual-demonstrative concepts as that shape, that texture, that 
interval of ume The nonconceptual content of experience also helps 
to make available such general, nondemonstrative recognitional con- 
cepts as the concept regular-diamond shaped and the concept red It also 
makes available more specific, but still general, recognitional con- 
cepts such as the concept Cambridge blue and the concept yellow ochre 
Perceptual states with nonconceptual content make these general 
concepts available to a thinker by providing the canonical, noninfer- 
ential basis for the application of these concepts to things given in 
experience The nonconceptual contents that make available these var- 
1ous perceptually based conceptual contents cannot, however, be iden- 
tified with any of the conceptual contents that they make available 

These claims are all to be construed as constitutive What it 1s to 
have the perceptually based concepts 1s to be elucidated, philosoph- 
ically, in terms of the relations of those concepts to the nonconcep- 
tual content of experience 

(2) Experiences with finer-grained nonconceptual contents can 
also provide an empirical basis for the acquisition, and so enter the 
causal explanation of the learning, of such general concepts as 
regular-diamond shaped 

(3) An experience with a certain nonconceptual content can make 
rational a Judgment of a conceptual content suitably related to the 
nonconceptual content that the experience represents as correct 

(4) Some of the nonconceptual content of our experience can be 
identical with the representational content of the experience of 
creatures that either possess no concepts, or possess only a set of 
concepts far more rudimentary than our own 

(5) The nonconceptual content of experience can enter the ex- 
planation of features of intentional action There 1s an important 
discussion in psychology of whether this content, or its precursors, 
enter the explanation of action only via memory, rather than control 
concurrent action on-line 5 But all agree that ıt has an explanatory 
role in some cases 


5 For an illuminating overview, see A David Milner and Melvyn A Goodale, The 
Visual Brain ın Action (New York Oxford, 1995) 
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The formulation of these five claims should make clear some of the 
wider ramifications of the otherwise apparently rather local issue of 
whether perception has a nonconceptual representational content 

We need to be very clear what we mean by ‘conceptual’ I shall be 
taking 1t that conceptual content is content of a kind that can be the 
content of judgment and belief Concepts are constituents of those 
intentional contents which can be the complete, truth-evaluable, 
contents of judgment and belief Conceptual content and concepts I 
take to have identities conforming to, indeed answerable to, Gottlob 
Frege's criterion of identity for senses Complete contents pand gare 
distinct if and only if it is possible for someone for whom the question 
arises rationally to judge that p without judging that q, and even while 
Judging not-g So the content “This country 1s Italy’ 1s distinct from the 
content “This country 1s this country’, the content ‘The floor-plan 1s 
square’ 1s distinct from ‘The floor-plan ıs a regular diamond’, the 
content ‘Your lost pen 1s there’ (pointing under a pile of papers) 1s 
distinct from ‘Your lost pen 1s where you last used it Concepts Cand 
Dare distinct if and only if there 1s some completing content X such 
that the complete content %(C) 1s distinct from the complete content 
XD), or, in other words, if and only if there 1s some content $ (C) 
such that someone for whom the question arises can rationally judge 
Z(C) without judging E(D) So the pairs 


thes country Italy 
square regular diamond 
there (pointing appropriately) where you last used the pen 


are all pairs of distinct concepts by this cnterion As these examples 
show, concepts as intended here may be demonstrative or indexical 

On this approach, any further connections between concepts and, 
for instance, language have to be earned by further argument If 
someone holds that a concept user must have a language ın which he 
can express at least some of his concepts, that ıs a substantive, 
nondefinitional thesis that needs to be established The same apphes 
to the thesis that concepts, so characterized, conform to Evans's 
“generality constraint” (op cit) 

It should, however, be uncontroversial that any content that can be 
expressed in language by the use of an indicative sentence, including 
sentences containing indexicals and demonstratives, will be a con- 
ceptual content That follows in the presence of the less controversial 
premise that any utterance of an indicative sentence, in a given 
context, expresses a content that ıs of a kind that could also be the 
content of a belief or judgment 
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The Fregean criterion as I have formulated it applies only to those 
contents which can be the content of Judgment or belief It would be 
a fair observation that the Fregean criterion as J have formulated it 1s 
an instance of a more general criterion, under which we can rightly 
say that there are distinct ways W; and Wg ın which a shape, or an 
interval, ıs given in perception simply because it 1s not obvious (even 
at the level of perception) that something given ın way W; 1s the same 
shape, or interval, as something given ın way Wọ So there certainly is 
some intuitive notion of *mode of presentation" that applies uncon- 
troversially at the level of perceptual content That more general 
notion of mode of presentation 1s not, however, the level at which the 
question here 1s posed My question 1s whether the ways mvolved in 
perceptual content are the same as the conceptual contents that 
feature in the content of judgment and belief The stipulations and 
terminology introduced so far do not settle this substantive question 

So much for minimal stage-setting I turn now to argue in support 
of at least the claims (1) — (4), and, if ıt 1s agreed that they are true, 
to consider why they are true 9 


II DISTINCT FROM CONCEPTUAL CONTENT, BUT MAKING IT AVAILABLE 
We can label as conceptualsts those who claim that the representa- 
tional content of experience 1s always conceptual Conceptualists are 
committed to the falsity of the first claim (1), with its consequence 
that there 1s a level of representational content that cannot be eluci- 
dated 1n terms of conceptual content Conceptualists have been 
tempted to say that the phenomena for whose description noncon- 
ceptual content has been invoked are better described by invoking 
conceptual contents containing perceptual-demonstrative concepts, 
or certain recognitional concepts I take these two kinds of concepts 
in turn, and ask whether they can really meet the conceptualist's 
needs 

(a) Perceptual demonstratrves A perceptual-demonstrative concept 
that shape 1s a concept made available by an experience as of a 
particular shape This perceptual-demonstrative concept will, in the 
context of its use 1n thought, refer to a shape, and the shape may be 
finely individuated Similarly for the perceptual-demonstrative con- 
cepts that shade, that sound, that taste Can the conceptualist say that 1t 
is these concepts which enter the representational content of expe- 


6 The discussion of (5) would involve more delving into the psychological liter- 
ature than 1s feasible here It 1s also significant that the case for the existence of 
nonconceptual content 1n perception can be made without reliance on any partic- 
ular thesis relating 1t to action 
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rience, and thereby fully capture the content that I have been claim- 
ing 1s nonconceptual? 

An initial problem with this version of the perceptual-demonstrative 
route is the presence of the general concept in the perceptual 
demonstrative, as the concept shape features in that shape No one can 
enjoy perceptual states, or indeed any other conscious mental states, 
with the conceptual content that shape in their representational con- 
tent unless he possesses the general concept shape It ıs, though, quite 
mmplausible that one must have that general concept 1n order to 
perceive objects as having various specific shapes Nor is it at all clear 
that two perceivers must have some more or less general predicative 
concept in order for them both to see some object as having the same 
specific shape One perceiver may think of a presented shape as that 
rectangle, the other perceiver may think of it as that straight-sided figure 
This difference in their thoughts need not prevent them from seeing 
it ın exactly the same way 

It is 1n any case not always required that a good, successfully 
referring perceptual demonstrative contain some general concept 
For those who think ıt must, a useful intermediate case that forces the 
softening of such a hard position 1s provided by the demonstrative 
there One difference between there and that 1s that the former must 
refer to a location, rather than anything present at that location But 
the demonstrative there does not do this by having the general con- 
cept of place as one of its constituents In fact, ıt 1s not at all obvious 
that a thinker, 1n order to have the rather unsophisticated capacity to 
think of a perceived location as there, must also have the general 
concept place or location in his conceptual repertoire Perhaps he 
must have the resources for introducing it, on the basis of concepts 
he already has but that 1s different from already deploying it 

If this intermediate case 1s granted, 1t could hardly be denied that 
we could conceive of a family of demonstratives, modeled on there 
maybe that-C for the color apparently at a location, that-T for the 
surface texture there, and so forth But I do not think this would be 
a stable stoppmg point as a limit on the possibility of perceptual 
demonstratives unsupplemented with general concepts, nor as a 
suitable limit for the conceptualist position This 1s so, first, because 
we do not actually have such devices ın our conceptual repertoire 
Our experience nevertheless still has the determinate, fine-grained 
character concerning textures, shades, and the rest all the same The 
fact that we could introduce such conceptual devices just serves to 
emphasize that what makes such devices available, namely, the rich 
representational content of experience, exists in advance of the 
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conceptual apparatus it makes possible That makes it umplausible 
that ıt can be identified with anything ın that conceptual apparatus 

Second, there seem to be cases in which a wholly unsupplemented 
perceptual demonstrative ‘that’ sull secures reference in a suitable 
perceptual context You may think, when viewing a vase 1n a museum, 
"That 1s beautiful’, and be referring in thought to the color, not to the 
shape or the surface texture You may think, watching a Balinese 
dancer, ‘That’s beautiful’, and be referring to the movement or the 
gesture rather than anything else Perhaps sophisticated thinkers can 
serve up general concepts 1f asked "What kind of thing or event are 
you thinking of?’ But it would be a stretch to msist that some general 
concept must have been entering the singular demonstrative compo- 
nent of their perception or their thought all along It seems, for 
instance, that someone could be introduced to the general concept 
timbre, applicable to sounds, by his first having an experience leading 
him to judge ‘That’s beautiful’, referring specifically to the tumbre of 
(say) a clarinet It may be that our listener only later apphes the 
concept timbre to the instance he had already perceived and thought 
about (‘That sound’ could be too unspecific to capture what he 
experienced as beautiful ) 

The natural treatment of such cases 1s that the type of the percep- 
tual demonstrative involved 1s individuated by a demonstrative ele- 
ment (which would expressed 1n an utterance by ‘that’)—an element 
to which I shall return—together with some particular way that enters 
the representational content of the perceptual experience that makes 
the whole perceptual demonstrative available The way in question 1s 
the way ın which the shade, or movement, or shape, or whatever is 
referred to by the demonstrative, 1s perceived 

For any given way in which a particular quality may be perceived, 
there 1s a specific kind such that that way 1s intrinsically a way for 
something of that kind to be perceived The way in which a shape 1s 
presented—for example, as a regular diamond—is intrinsically a way 
for a shape to be perceived, and not anything else The same applies 
to the way m which a texture, or a musical interval, 1s perceived It 
does not make any sense to suggest that a shape might be perceived 
in the way a musical interval 1s perceived In synaesthesia, the subject 
does not literally perceive a vowel sound in the way in which a color 
can be perceived On reflection, ıt should not be a surprise that for 
each way, there 1s a specific kind such that the way 1s intrinsically a way 
for something of that kind to be perceived Since the way contributes 
to a correctness condition, this general connection between ways and 
kinds 1s equivalent to this proposition that for an object to have the 
property a given way presents 1t as having, the object must meet a 
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certain condition, and this condition 15 applicable only within objects 
of a given kind 

This general connection between ways and kinds 1s important 
because it helps to contribute to an explanation of why a general 
concept 1s not always needed to supplement the ‘that’ in a perceptual 
demonstrative to secure manifest reference to one kind of quality of 
the perceived object or event, rather than another The kind of 
property referred to—a shape property rather than a sound property, 
say—is fixed by the perceptual way that contributes to the individu- 
ation of the perceptual demonstrative ın question Provided that the 
demonstrative ‘that’ 1s linked to a given way, there will be a general 
kind such that the reference of the demonstrative must be of that 
general kind 

Ways of being perceived are inherently general For instance, the 
shapes of two different objects may be perceived in exactly the same 
way, either on one occasion or on different occasions. By contrast, 
perceptual-demonstrative reference to shapes themselves (to con- 
tinue with that example) proceeds fundamentally via the shape of 
particular presented individuals When a thinker thinks ‘That would 
make a good logo’, referring to a shape, he is characteristically 
thinking of the shape as given in the perception of some particular 
object in his experience, and the truth conditions of his thought 
concern the perceived shape of that particular object Maybe one can 
refer perceptually demonstratively to the shape common to several 
objects, each of whose shape one perceives, and perceives to be 
the same for all of them—thus ‘That shape, instantiated equally in all 
of those objects, would make a good logo’ But this still involves a 
tie to particulars, several rather than just one This particularity, 
which seems distinctive of, and essential to, perceptual demonstra- 
tives, ıs one reason that one cannot simply identify the perceptual- 
demonstrative sense with the relevant way of being perceived The 
way, which ıs general, lacks the tie to particulars which 1s character- 
istic of the perceptual demonstrative 

The lınk to a particular perceived individual that has the property 
demonstratively thought about 1s a characteristic of the demonstra- 
tive element in the perceptual-demonstrative concept It would not 
be right to regard a perceptual-demonstrative concept that shape as 
thought on a particular occasion as built up from a way ın which a 
shape may be perceived plus some entrely generic constituent. The 
lınk to a particular, given in perception or experience, must be 
captured if we are to charactenze fully the perceptual-demonstrative 
concept used on this occasion 
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It 1s important to distinguish between ways of being perceived and 
ways of having a certain generic quality A way in which a shape 
property may be perceived 1s to be sharply distinguished from a way 
of beng shaped A way of being shaped has to do with shapes 
themselves—it 1s a way of occupying space—and does not have to do 
with the way 1n which shapes are perceived On the natural, unforced 
way of understanding the notion of a way of being shaped, the 
property of being a square and the property of being a regular 
diamond are actually the same way of being shaped This means that 
one could not individuate perceptual concepts, whether fine grained 
or rough grained, in terms of which way of being shaped they are 
concepts of If one uses ‘way’ in this terntory for ‘way of being 
shaped’, or ‘way of being colored’, or more generally ‘way of having 
some generic kind of perceptible property or relation’, one will need 
additionally something else to capture the different ways ın which 
these ways of being shaped may be perceived 

Although a way contributes to a correctness condition, 1t 1s impor 
tant that which object 1s presented 1n a given way 1s not simply a 
matter of the object's fitting that way A distant aircraft ın the sky may 
be presented as being ın a certain direction It may not in fact be in 
that direction, because the light rays are passing through refracting 
bodies of differentially heated air When an object 1s presented in a 
given way in perception, which object 1s presented in that way 1s at 
least partly a matter of causation, as H P Grice” argued It ıs a 
consequence of this point that one cannot regard ways as individu- 
ated by the condition for them to refer to some object or property It 
1s, however, a highly plausible thesis about each Fregean sense (1n- 
cluding perceptual demonstratives) that ıt can be individuated by the 
condition for 1t to refer to a given object or property ? If this plausible 
thesis 15 correct, then we have a further argument that ways are not 
Fregean senses 

This distinctness of ways from senses 1s further confirmed by the 
natural explication of the conditions for a perceptual-demonstrative 
concept to refer to a given object Those conditions concern the 
causal relations of an object to a way in which the object 1s given in 
perception Such conditions take for granted the notion of a way, and 
then draw upon it m elucidating the condition for the demonstrative 
to refer to a given object 


7 "The Causal Theory of Perception,” Proceedings of the Aristotehan Society, Supple- 
mentary Volume xxxv (1961) 121-52 

8 See Michael Dummett, The Interpretation of Frege's Philosophy (Cambridge Har- 
vard, 1981), chapter 3, pp 45ff 
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These remarks against treating fine-grained perceptual content as 
explicable in terms of perceptual-demonstrative concepts should not 
be taken to disparage the very idea of a fine-grained perceptual 
demonstrative There are perceptual-demonstrative ways of thinking 
of presented shapes, colors, sounds, and the rest Perception of an 
object's shape may make available a way of thinking of that particular 
shape, as ‘that shape’ This 1s a way of thinking that also makes it 
possible to wonder or to speculate whether some other object has that 
very shape, the shape one perceives the presented object to have 

It ıs constitutive of perceptual demonstratives that they can be used 
by the thinker only while the perceptual experience that makes them 
available continues McDowell suggests that such a dependence on 
the occurrence of the experience “would cast doubt on its being 
recognizable as a conceptual capacity at all” (op ct, p 57) This may 
be one of the reasons he moves to endorse recognitional concepts 
as the correct treatment of perceptual content Of experience- 
dependent demonstratives, he says that they look “like Wittgen- 
stein’s case of the person who says ‘I know how tall I am’, putting 
his hand on top of his head to prove ıt” (op cit, p 57) ? But the 
person in Ludwig Wittgenstein’s example does not, by saying or 
thinking "his tal? come to know his height in meters or any 
other unit, which 1s what knowing how tall one 1s requires on the 
usual understanding It is not even clear that such a person 
thereby (I emphasize thereby) comes to know how tall he 1s, 1n the 
sense that he could indicate which of the markings on a wall on the 
opposite side of the room ıs roughly his height By contrast, a 
perceptual-demonstrative thought latches on to a magnitude, or 
shade, or color, only if that magnitude, or shade, or color, 1s itself 
given in the experience that makes the perceptual-demonstrative 
concept available There 1s no possibility of making perceptual- 
demonstrative reference to a magnitude, or shade, or color, and 
not knowing what magnitude, or shade, or color it 1s one 1s 
thinking about Such perceptual-demonstrative reference ıs the 
most fundamental way of knowing what magnitude, or shade, or 
color is 1n question 

We should also distinguish here the conditions for making demon- 
strative reference in language from the conditions for employing a 
perceptual-demonstrative concept The person who utters ‘that ta- 
ble’, pointing behind himself to a table he 1s not seeing, certainly 


° His reference ıs to Wittgenstein’s Philosophical Investigations, GEM Anscombe, 
trans (Oxford Blackwell, 1958, second edition), § 279 
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refers to a table, but he has no perceptual-demonstrative concept of 
it A similar point could be made for an utterance of ‘that shade’, 
pointing to the wall behind oneself, without looking at ıt There 
are also important background conditions that must be met for 
perceptual-demonstrative reference ın thought to succeed Strobo- 
scopic lighting, undetected movement, and much else can under- 
mine the holding of these background conditions Perceptual- 
demonstrative reference can fail even when it seems to the thinker to 
succeed (These background conditions will be all the more impor- 
tant to one, like myself, who thinks that perceptual demonstratives 
and observational concepts can be individuated in terms of the 
conditions under which contents contaming them can be not merely 
rationally judged, but known 1°) But there are plenty of cases in which 
the required background conditions are fulfilled, and there can be 
perceptual-demonstrative reference to finelyandividuated properties 
and relations My point has not at all been that such perceptual 
demonstratives are nonexistent Rather, my point 1s that they are 
themselves individuated ın part in terms of ways in which properties, 
magnitudes, and relations are given m experience, and so cannot be 
used to elucidate the nature of such ways of bemg experienced 

(b) Recognitional concepts The other position tempting to some 
conceptualists 1s that the fine-gramed representational content of 
experience 1s to be captured not by perceptual-demonstrative con- 
cepts, but by fine-grained recognitional concepts This is McDowell’s 
view in Mind and World, and 1t squares with his most explicit recent 
discussion of the issues !! McDowell writes ın Mind and World “Why 
not say that one 1s equipped to embrace shades of colour within 
one’s conceptual thinking with the very same determinateness with 
which they are presented in one’s visual experience, so that one’s 
concepts can capture colours no less sharply than one’s experience 
presents them?" (op ct, p 56) Soon after in the same book he makes 
clear "What ıs ın play here 1s a recognitional capacity ” (op eit, p 
57) The recognitional capacity associated with the recognitional 
concept persists, 1f only briefly, after an experience which is of the 


10 See my Beng Known (New York Oxford, 1999), chapter 2 

11 “Replies to Commentators,” Philosophy and Phenomenological Research, LYNI, 2 
(June 1998) 403-31 In the part of these "Replies" which deals with me, McDowell 
writes always of demonstratives, rather than recognitional concepts But it 1s impor- 
tant that for McDowell in his Reply, “‘that way’ in ‘It looks that way’ 1s predicative, 
not substantival Expressions at this place ın this construction express, rather than 
refer to, ways things can look" (p 418) On McDowell’s theory, expressions ın that 
place express recognitional concepts This is fully ın accord with his position in 
Mind and World—see p 57 
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kind recognized (op cit, p 57) McDowell seeks to capture part of the 
fine-grained spatial content of experience with a conceptual content 
of the following sort, which he would say 1s represented as correct by 
the experience 


This 15 shaped R. 


Here R expresses, for McDowell, a recognitional concept of a way of 
being shaped Similarly for color, on McDowell’s view we have the 
conceptual content 


This ıs colored S 


in the representational content of the experience, where S 1s a 
recognitional concept of a fine-sliced shade 
If 1t was wrong to have the general concept shape 1n the demon- 
strative attempts to capture the fine-grained content of experience, it 
seems equally wrong to have the general relational concept shaped 
in the representational content under this conceptualist 
treatment The same applies to the concept colored I shall 
not, however, trade on this point, and shall take ıt that McDowell 
could use precisely the points I made earlier about the absence of a 
need for supplementation by a general concept He could move 
simply to propose the conceptual contents ‘This 1s R’, ‘This 1s S’ as 
the fine-grained representational content of experience The recog- 
nitional capacities underlying these recognitional concepts persist, 
“possibly for quite a short period," according to McDowell in Mind 
and World (op cit, p 172), where he holds that they can be employed 
in the absence of perception of the finely-sliced properties to which 
they refer 
As Diana Raffman!? observes, it 1s well known that perceptually- 
based discrimination of properties far outreaches memory and 1den- 
tification of those properties Recognitional capacities are by their 
very nature constrained by memory capacities I would add to Raff- 
man’s point only two observations First, even if memory were as 
finely discriminating as perception, that still would not make it nght 
to regard recognitional concepts as entering the representational 
content of experience There cannot be recognition when the per- 
ceptual property ıs encountered for the first time in a given way A 
first-time experience of a property nevertheless still has a specific, 
fine-grained representational content And second, in those counter- 
factual circumstances ın which memory were as discriminating as 


12 "On the Persistence of Phenomenology,” in Thomas Metzinger, ed , Conscious 
Expenence (Paderborn Schoeningh, 1995), pp 293-308, pp 294ff 
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perception, the experience would still have a specific content which 
could exist even if the recognitional capacity did not The memory 
structures required for the recognitional concept and its correspond- 
ing recognitional capacity, and their subpersonal underpinnings, 
could all be absent, and yet the experience still have a fine-grained 
representational content The specific fine-grained content would 
show up in matching tests that do not need to rely on memory, as 
opposed to current perception This modal point applies however 
short the period for which the recognitional capacity 1s said to exist 

It seems to me, then, that neither perceptual demonstratives nor 
recognitional concepts can capture the representational content of 
perception But however convincing it may be, candidate concept by 
candidate concept, that each candidate concept cannot capture per- 
ceptual content, 1t 1s reasonable to ask here for more theoretical 
explanation Is there some deeper, and more general, reason why 
perceptual content cannot be captured by any kind of conceptual 
content? I shall return to this, when we are in a better position to 
characterize more fully the nature of the distinction between concep- 
tual and nonconceptual content 


III. LEARNING OBSERVATIONAL CONCEPTS 

Now let us take the phenomenon of learning a new observational 
concept from instances of it that are perceived when one 1s taught, or 
otherwise acquires, the concept Consider the representational con- 
tent of the subJect's experience when she 1s learning the concept 
pyramid, considered as an observational shape concept 

On the one hand, the representational content of this experience, 
in the case of learning from a positive instance, must be sufficient for 
someone rationally to apply the concept—must entitle her to apply 
the concept—when experience 1s being taken at face value If that 
were not so, it would not be an observational concept after all Yet on 
the other hand, this representational content cannot include the 
concept pyramid, for that would require the learner already to possess 
the concept The learner could have such experiences only if the 
lesson were unnecessary 

The natural solution to this last quandary 1s to acknowledge that 
there 1s such a thing as having an experience of something as being 
pyramid shaped that does not involve already having the concept of 
being pyramid shaped What such an experience will have 1s a non- 
conceptual content which, if correct, 1s sufficient for something’s 
falling under the observational concept pyramid 

In fact, that 1s to understate the case Not merely is the holding of 
the nonconceptual content represented as correct in the positive 
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cases involved in learning sufficient for the object m question to be a 
pyramid The experiences from which, as presenting positive in- 
stances, one can rationally learn the concept, are those which have 
nonconceptual representational contents of the sort mentioned in 
the very condition for possessing the observational concept that 1s 
being learned There is a specific kind of nonconceptual content of 
experience to which one’s judgments ‘That’s a regular diamond’ 
must be rationally sensitive if one 1s to possess the observational 
concept regular diamond There 1s a different specific kind of noncon- 
ceptual content of experience to which one’s judgment "That's a 
square’ must be rationally sensitive if one 1s to possess the observa- 
tional concept square There is a specific kind of nonconceptual 
content of experience to which one must be sensitive if one 1s to 
acquire a recognitional concept for the musical interval of an aug- 
mented fourth It 1s a different kind from that to which sensitivity 1s 
required ın possessing a recognitional concept for a diminished fifth, 
and so forth !? 

This rational sensitivity is not a matter of inference, even though 
the occurrence of an experience with a certain nonconceptual con- 
tent does make rational a perceptual judgment with a conceptualized 
content It 1s intuitively wrong to classify the case as one of inference 
It also could not be correct, 1f the perceptual content is nonconcep- 
tual inferential relations can hold only between (states with) concep- 
tual contents We can, as reflective thinkers, indeed introduce ways of 
thinking about the nonconceptual ways in which things are given But 
this is built on a more primitive level of rational sensitivity to the 
experience’s nonconceptual content itself Such reflective thought 1s 
not necessary for having the capacity to make perceptual Judgments 
about one’s environment 


IV MAKING RATIONAL TRANSITIONS TO PERCEPTUAL JUDGMENTS 

I have emphasized that the way in which some thing, or property, or 
relation 1s given in the nonconceptual content of an experience 1s 
something that contributes to what ıt ıs like to have that experience 
These ways which feature in nonconceptual content are then at the 
conscious, personal level, and are not merely subpersonal As features 
of the subjective experience, their presence can entitle a thinker to 
make a particular judgment, or to form a certain belief 


18 For some observational concepts, the kind may be amodal, and so instantiated 
in experiences in more than one modality This ıs plausible for straight and nght- 
angled, construed as observational concepts 
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A thinker can be rational 1n making a transition from an experi- 
ence with a certain nonconceptual representational content to a 
Judgment with a certain conceptual content, in particular 1n making 
a transition to judging a content in which an observational concept 1s 
predicated of presented objects or events Such a transition 1s rational 
when the thinker 1s entitled to take her experience at face value, and 
when the observational concept 1s individuated 1n part at least as one 
that the thinker must be willing to judge when experience has a 
certain kind of nonconceptual representational content (and 1s being 
taken at face value) Consider Mach’s example of a closed nght- 
angled figure formed from four straight lines, and whose symmetry 
about the bisectors of its opposite angles 1s perceived Such an 
experience can make rational the judgment That's a regular diamond 
It does not make rational the Judgment That's a square That latter 
Judgment is made rational rather by a different nonconceptual con- 
tent, ın which the perceived symmetry of the closed figure 1s a 
symmetry about the bisectors of its sides Which symmetries are 
perceived in each case does not need to be something known to, or 
even conceptualized by, the perceiver 

The nonconceptual content in question throughout this paper 1s 
nonconceptual representational content, with a correctness condi- 
tion Consider any case 1n which an observational concept Fis plau- 
sibly individuated partly by its relations to a particular sort of 
nonconceptual content It 1s then a philosophical task to show that 
the holding of the correctness condition for the nonconceptual 
content in question ensures the holding of the correctness condition 
for the conceptual content That’s F In our simple spatial cases, one 
could show this by showing that the shape which something must be 
when it 1s presented (veridically) 1n an experience with a certain 
nonconceptual content 1s a shape which 1s sufficient for the observa- 
tional concept ts square to apply to ıt 

This 1s an instance of a task that exists for any rational transition 
that involves a concept or concepts at some point or other The task 
exists equally for the case of logical transitions If a premise gives a 
good reason for accepting a certain conclusion, because of the logical 
form of the inference, there must be some explanation, in the 
semantics of the logical constants involved in the transition, of why 
this 1s a valid transition That 1s why we need a semantics, and not 
merely a proof theory, for alogic (Even in the special case 1n which 
the semantics 1s itself proof theoretic, the need for a semantical 
theory 1s still being granted ) The ordinary thinker does not need to 
know what the explanation of the validity of a particular transition 15, 
in either the logical or the perceptual case But the explanation must 
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exist if the transition 1s to be valid Only so is there an explanation of 
why 1t 1s truth preserving, and thereby of how it can lead to new 
knowledge 

My own view is that it ıs always a task of philosophy to explain why 
a transition 1s a good one This requirement, in my view, applies to 
the conceptual-to-conceptual case of logic, to the nonconceptual (but 
sull representational)-to-conceptual case of perceptual judgment, 
and to the arguably wholly nonrepresentational-to-conceptual transi- 
tion from the occurrence of a pam in a subject to his conceptualized 
selfascription of pain !* 

The general position on nonconceptual content that I have out- 
lined so far may seem intuitive, but it has been subjected to a 
philosophical critique by McDowell His most fundamental objection 
1s that experiences can have a rational bearing on thought only if 
their content 1s conceptual “The point of the claim that experience 
involves conceptual capacities 1s that ıt enables us to credit experi 
ences with a rational bearing on empirical thinking" (op ct, p 52) 
This claim needs an argument, and McDowell's argument for 1t in 
Mind and World ıs that rational thinking involves the possibility of 
scrutinizing the relations between experience and judgment Noth- 
ing 1s outside the limits of rational sel£scrutiny (op at, p 52), even 
if, after scrutinizing the case in question, we decide to endorse it 
McDowell’s view 1s that it 15 hard to see how this scrutiny would be 
possible unless experience has only conceptual content ( op cit, pp 
52-53) 

That last step seems to me a nonsequitur I accept that we can 
rationally scrutinize any instance of any putative reason-giving rela- 
tion, including the relation between experience and Judgment Mc- 
Dowell’s concern is that on a theory lıke mine, the subject cannot 
gain the required “comprehensive view of the two contents”—that of 
the experience and that of the judgment (op ct, p 166) On my 
treatment, however, a thinker can ask ‘Is something’s looking that 
way a reason for judging that it’s square’, for instance On the 
approach I advocate, ‘that way’, in this particular occurrence, refers 
demonstratively to a way in which something can be perceived The 
reference itself 1s made by something conceptual demonstrative con- 
cepts can enter conceptual contents There is no requirement that 


14 I hope this and the preceding paragraphs clarify my intentions at around page 
80 of A Study of Concepts, about which McDowell expresses some puzzlement “1t 1s 
hard to see much m the way of a further issue about how the reason [supphed by 
an experience with an allegedly nonconceptual content] can be a good one" 
(“Rephes,” p 418) 
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the reference of the demonstrative be conceptualized (I shall return 
to such uses of ‘that way’ 1n a few paragraphs) So thought can 
scrutinize and evaluate the relations between nonconceptual and 
conceptual contents, and obtam a comprehensive view of both On 
McDowell's theory, of course, the demonstrative ‘that way’ expresses 
a sense (a recognitional concept), which does on his view enter the 
representational content of experience I have rejected that treat- 
ment on other grounds—the required recognitional capacities do 
not exist, and do not need to exist for the experience to have a 
fine-grained content But quite apart from that, the mere possibility 
of the treatment I have offered shows that 1t 1s a nonsequitur to move 
from an agreed premise of the required scrutinizability in conceptual 
thought of the rational relations between experience and judgment 
to the conclusion that experience itself must have only conceptual 
representational content As long as we can think about the noncon- 
ceptual representational content, as we certainly can, we can have the 
required scrutinizability 

McDowell also holds that there 1s a requirement of articulability of 
reasons, including perceptual reasons, and that this too supports his 
view that the representational content of experience must be wholly 
conceptual *[Peacocke] has to sever the tie between reasons for 
which a subject thinks as she does and reason she can give for 
thinking that way Reasons that the subject can give, m so far as they 
are articulable, must be within the space of concepts" (op cit, p 165) 
I am not at all sure myself that there 1s such a requirement of 
articulability But if there ıs, ıt seems to me that the use of the 
demonstrative ‘that way’, construed in the way it 15 on my approach, 
would allow the friend of nonconceptual content to meet any such 
demand ‘I believe it’s square because it looks that way’ 1s an articu- 
lation of the subject's reasons Once again, the conceptual character 
of the conceptual constituent 'that way' must be sharply disunguished 
from the nonconceptual character of its reference, a nonconceptual 
way in which something 1s perceived 

I would invoke demonstrative ways of thinking of ways 1n address- 
ing a closely related objection of Brewer’s (op cit, pp 166-69) He 
argues that a thinker must be capable of recognizing her reasons as 
such, that 1s, as reasons This demand does not, Brewer argues, 
intrinsically raise difficulties of principle in the case of a mental state 
that gives reasons, provided that the state has conceptual content But 
in the case of a mental state with an allegedly nonconceptual content, 
“mere possession of the state involved in having such a ‘reason’ 1s 
explicitly insufficient for the subJect's understanding of the proposi- 
tion whose association with this state grounds its putative status as hus 
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reason for doing what he does The only alternative seems to be that 
he must have some second-order knowledge of the relation between 
mental states of the type ın question and the truth of the belief for 
which he thereby, and only instrumentally, recognizes his having a 
reason" (op cit, pp 167-68) Brewer says that the nonconceptualist 1s 
"condemned to follow the discredited classical foundationalists and 
coherentüsts" (op ct, p 166) In short, his position 1s that the 
nonconceptualist can provide for the recognition of his own reasons 
only by arguing as follows 


(1) This state is F 
(1) Anything that ıs Fıs a reason for believing that p 
(uj) Hence I have reason to believe that p (op ct, p 168) 


Brewer adds that “what stands ın place of ‘F’ will have, at best, to 
be hideously complex" (op ct, p 168) 

I agree with Brewer that, if the nonconceptualist could account for 
the recognition of the reason that his experience gives hum for 
making a perceptual Judgment only in this second-order, instrumen- 
tal fashion, the position would be unacceptable But I maintain that 
the nonconceptualist’s position does not have the consequence 
Brewer clarms Here 1s a way that a thinker can come to recognize a 
state with nonconceptual representational content as giving reasons, 
without taking the second-order, instrumentalist route The percep- 
tual experience represents some presented object or event as having 
some property, given 1n a certam way W So the expenence has a 
content (if we put it, conveniently but messentally for the present 
point, in object-dependent form) 


(v) x, given 1n way s, has property P, given in way W 


The way s in which the object 1s given will capture its apparent 
distance and direction, egocentrically characterized, from the per- 
ceiver !5 Now suppose this way Wis, for example, one of the noncon- 
ceptual ways in which a shape can be given, and is mentioned as 
sufficient for being square ın the possession condition for the con- 
cept square On my account, the subject is then entitled to move from 
(iv) to the conceptualized content 


(v) That object [given 1n way s] 1s square 


15 No doubt the way W could also be said in some pre-theoretic sense to be part 
of the way the object is given in perception But it 1s important to distinguish 
between those ways which help to determine which object 1s perceived from those 
which do not s1s in the former class—though as emphasized earlier, this determi- 
nation 15 not a matter of fitung 


258 THE JOURNAL OF PHILOSOPHY 


Similarly for any other way whose presence ın an experience gives 
good reason for applying an observational concept 

Now, what would be involved ın a subject’s reflecting on these 
transitions, and 1n coming to appreciate his reasons as reasons? (1v) 
and (v) do not seem to be hideously complex, complexity 1s not the 
heart of the issue But there are two interesting capacities a thinker 
must have if she 1s to be capable of appreciating both the occurrence, 
and the rationality, of her transition from (iv) to (v) 

First, our reflective subject must have a special way of thinking of 
the way W There seems to be a phenomenon here that 1s a partial 
analogue, for the relations between nonconceptual contents and 
demonstrative concepts of them, of the relations that hold between 
concepts and canonical concepts of those concepts—"conceptual 
redeployment,” as I called ıt © When some property 1s given to a 
thinker in perception in way W, then, if that thinker 1s sufficiently 
reflective, there seems to be a conceptual way C,, of thinking of that 
way W, where this conceptual way C,, seems to be made available to 
the thinker precisely by her enjoying an experience whose noncon- 
ceptual content includes that way W Presumably, this relation of 
“being made available by’ consists at least in part ın a certam canon- 
ical sensitivity of judgments involving the concept C, to evidence 
about the way W which the subject has precisely because she 1s 
experiencing some property as given in way W 

Second, ın appreciating that the occurrence of an expenence with 
the representational content (1v) entitles her to judge the content 
(v), our thinker will be displaying a sensitivity to a particular bound- 
ary This 15 the boundary between those states which are, and those 
states which are not, mentioned 1n the possession condition for a 
given concept such as square Our thinker will have moved from the 
ground level of satisfying a possession condition to the level of 
thinking about ıt One, perhaps the basic, way to make this transition 
1s to ask questions that are in the first 1nstance not about concepts, 
but about the world Such a question might be ‘Would something’s 
looking that way Wgive reason to think it 1s square?’ One can answer 
such questions by drawing on one’s ordinary, ground-level abilities to 
react rationally to one’s perceptual states in coming to make judg- 
ments about the world Someone who satisfies a possession condition 
can clamber her way up to reach some understanding of what that 
possession condition 1s 


16 See my Beng Known, sections 5 5-5 6 
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This description of the transition from perception to judgment, 
and of the subject's appreciation thereof, ıs not second order Nor is 
it of the highly problematic kinds in the sorts of classical foundation- 
alist and coherentist writings Brewer nghtly criticizes (1v) and (v) are 
not themselves about mental states at all It ıs true that (1v) becomes 
pertinent because of its relation to the content of a mental state But 
that will equally hold on a conceptualist's account of the reason 
relation and of appreciation of it as a relation of reason giving The 
content ıs conceptual, according to that theorist, and stands in varı- 
ous reason-giving relations appreciable by the thinker But that con- 
tent 15 relevant only because it 15 the representational content, on the 
conceptualist’s view, of the perceptual experience in question That t 
1s relevant only because of that relation does not make the concep- 
tualist’s view second order, or instrumentalist Equally the corre- 
sponding feature of (iv) and (v) cannot make the nonconceptualist’s 
account second order or instrumentalist either It ıs the relation 
between (1v) and (v) which 1s crucial in the account of the reason- 
giving relation, and appreciation of it as such, and that does not 
involve anything second order 

A full theoretical, philosophical account of why the transition from 
(av) to (v) 1s valid would of course involve some philosophical theory 
The theory would be part of a general theory of content A full 
theoretical account would equally be required in the case of logical 
transitions between purely conceptual contents The philosophical 
theory does not need to be known to the ordinary thinker in either 
of these rather different cases In both cases, there 1s a willingness to 
make transitions between certam contents, which 1s constitutive. of 
possession of the observational concept or the logical concept, re- 
spectively On my account, in the former case the content of the 
entitling or reason-giving state is nonconceptual, and ın the latter it 
is conceptual The ordinary thinker can appreciate his reasons as 
reasons 1n either case In either case, he may or may not go further 
in reflecting on the nature of the good reasons involved 

A different question about this approach does not query the dis- 
tinctions I have drawn, but raises at least a problem about how the 
issues have been framed !? Surely, I would not object to a regimen- 
tation of the content of a conceptual state in which, for instance, a 
recognitional concept for a particular size of diamond shape were 
captured by the very nonconceptual content mentioned ın its posses- 
sion condition? But if we can use nonconceptual content in capturing 


17 Here I am indebted to Stephen Schiffer 
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conceptual content, has the issue with the conceptualist been prop- 
erly joined when we take ıt as the question of whether experience has 
only conceptual content? I agree that we could employ such a means 
of capturing a recognitional concept But if we do use the apparatus 
of nonconceptual content in characterizing conceptual states, we 
must distinguish sharply between the kinds of relations m which a 
state must stand in order to possess the original nonconceptual 
contents, and those in which 1t must stand 1n order to possess those 
conceptual contents which are, under this proposal, captured by 
nonconceptual contents The arguments of the next section will 
further distinguish these two sets of relations 


V KINDS OF CONTENT, ANIMAL PERCEPTION, AND OBJECTIVITY 
Cats, dogs, and animals of many other species, as well as human 
infants, perceive the world, even though their conceptual repertoire 
is limited, and perhaps even nonexistent These perceptions are 
subserved by perceptual organs, and ın the case of higher species, 
subserved by brain structures similar in significant respects to 
those which subserve mature human perception By the "hard 
line” on animal perception, I mean the thesis that none of the 
conscious perceptual states with representational content enjoyed 
by mature humans can be enjoyed by nonlinguistic animals with- 
out concepts, or with only minimal conceptual capacities By the 
“soft line,” I mean simply the denial of the hard line So the soft 
line says that some of the conscious perceptual states with repre- 
sentational content enjoyed by mature humans can be enjoyed by 
nonlinguistic animals without concepts, or with only minimal con- 
ceptual capacities 

For what 1t 15 worth, pretheoretical intuition seems to find the hard 
hne too hard to accept The hard line entails that the following 
cannot be literally true that the animal has a visual experience as of 
a surface at a certain orientation, and at a certain distance and 
direction from itself, ın exactly the same sense in which an adult 
human can have a visual experience with that as part of its content 
Abandoning all pretense at unbiased terminology, I shall call the 
conclusion that that cannot be hterally true the unintuitwve conclusion 
The soft hine, which prima facie at least ıs not committed to the 
unintuitive conclusion, 1s naturally developed hand-in-hand with a 
theory of nonconceptual representational content According to the 
soft line, 1t is nonconceptual representational content which can be 
common to visual experiences which both you and a mere animal 
may enjoy 
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Those who take the hard lme need not deny that animals have 
some perceptual sensitivity to their environment, a sensitivity which 
explains their actions McDowell, who takes the hard line, insists that 
animals have such a perceptual sensitivity “It 1s a plain fact that we 
share perception with mere animals” (op cit, p 114) !? Nor does 
McDowell have in mind, for instance, only some kind of informa- 
tional sensitivity that would make animals automatons of the sort 
which René Descartes apparently believed them to be In Mind and 
World, McDowell speaks of *proto-subjectivity" (op at, p 117) He 
says that there 1s a legitimate kind of talk of what the features of the 
environment are for an animal, which "expresses an analogue to the 
notion of subjectivity, close enough to ensure that there 1s no Carte- 
sian automatism 1n our picture" (op ct, p 116) All the same, ıt must 
be only an analogue on McDowell’s view It 1s, on his account, not 
literally true that the mere animal has a visual experience as of a 
surface at a certain distance and direction in exactly the same sense 
that mature concept-using humans do 

It 1s certainly a necessary condition of being reasonable in rejecting 
the unintuitive conclusion that one address the arguments that in 
perceptual experience, there 1s only conceptual, and not nonconcep- 
tual, content But if we can develop a theory of nonconceptual 
representational content, and answer the conceptualist’s objections 
to it, as I have been trying to do, then much of the pressure to adopt 
the unintuitive conclusion 1s substantially reheved 

Although I think, however, that the unintuitive conclusion 1s 1n- 
deed to be rejected and the soft line ıs right, the truth ın this area 
seems to me much more interesting than those somewhat flat con- 
clusions suggest It seems to me that there is a good, Kantian point in 
McDowell which should not get lost ın the endorsement of the soft 
line There 1s something very plausible in McDowell's Kantian posi- 
tion that “the objective world ıs present only to a self-conscious 
subject, a subject who can ascribe experiences to herself Itis the 
power of conceptual thinking that brings both the world and the 
self into view Creatures without conceptual capacities lack self- 
consciousness and—this 1s part of the same package—experience of 
objective reality” (op ct, p 114) Although of course a creature with 
perceptions with nonconceptual representational contents has states 
whose correctness conditions concern the objective world, it 1s a 
question whether a creature without the specific conceptual appara- 
tus McDowell mentions would be conceming— would even have the 


18 See also pp 50, 64-65, and Lecture VI, section 4-7 
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resources for conceiving—this world as objective So the question 
arises Is there some way we can consistently acknowledge this point 
while also maintaiing the soft line? I shall be arguing that there 
15— but to do so we need some deeper theory of the nature of the 
distinction between conceptual and nonconceptual content 

I first draw a distinction within the possible grounds, or reason- 
giving states, which lead, in a given subject’s psychological economy, 
to acceptance of a given kind of content We can distinguish those 
grounds being exhaustive, for a given content, and their being canon- 
ical Consider the minimal possible ways a creature capable of per- 
ceptual states with nonconceptual content may come to accept of 
three perceived objects a, b, and c that they are in a straight line, or 
may come to reject that same content The content may be accepted 
on the basis of a perception with a nonconceptual representational 
content to that effect Such a creature may also store this information, 
with or without memory images of the original perception The 
content may come to be rejected 1f, ın moving or being moved 
around, the subject does not have perceptions of the sort which, from 
other angles, should be produced by a, b, and cif they are ina straight 
line Now suppose such perceptual and cognitive-map based ways of 
coming to accept or reject exhaust the subject’s ways of coming to 
accept or reject such contents (or at least, exhaust those which are 
distinctive of those contents, 1f this subject ıs capable of engaging 1n 
more general modes of reasoning) Such exhaustiveness of the per- 
ceptual-cum-cognitive-map based ways of coming to accept or reject 
those contents contrasts sharply with the case of possession of the 
concept of being ın a straight line For a thinker with the concept 
collinear, any grounds for thinking that three things stand in the same 
relation as 1s perceptibly instantiated when the thinker sees three 
things to be collinear are equally grounds for judging those first three 
things to be collinear There are no restrictions on what those kinds 
of grounds may be What those grounds may be 1s something to be 
discovered The grounds are in no way restricted to what can be 
attained through perception and the use of a cognitive map These 
additional grounds need not be anything particularly sophisticated 
Even the child who moves some straight-edged physical object to 
check whether three objects are in a straight line, when he cannot tell 
this, for one reason or another, just by looking, 1s manifesting a 
sensitivity to this open-endedness of grounds 

The presence in the thinker's thought of possible additional 
grounds is entirely consistent with perception having some canonical 
status ın the individuation of the concept collznear, considered as an 
observational concept Some grounds can have a privileged status, 
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consistently with there being no restrictions in principle in a thinker’s 
thought on the range of possible grounds The fact that certain 
grounds are canonical does not mean that they are exhaustive We 
must distinguish the exhaustive from the (merely) canonical 

It 1s important to notice two points about the distinction between 
the exhaustive and the canonical First, when a certain body of 
grounds is not exhaustive in principle in a thinker’s thought, the 
thinker 1s tacitly operating with the notion of the same property, or 
same relation, 1n the world It 1s the same relation that can be 
detected, on the one hand, by perception with the use of cognitive 
maps and, on the other, by the additional means—the ruler, or 
whatever else the ingenious thinker may come up with. The thinker 
for whom grounds involving perception-cum-cognitive maps are not 
exhaustive for acceptance or rejection of a given kind of content must 
be capable of kinds of reasoning that involve grasp of the notion of 
sameness of property and sameness of relation The same holds for 
the case of sameness of objects, if we apply these points to the 
distinction between genuine concepts of objects, and the ways objects 
are given in the nonconceptual content of perception 

The second point to notice about this distinction is that 1t has to do 
with the nature of objectivity It 1s a consequence of the idea of things 
being objectively thus-and-so, of its being objectively the case that 
these objects stand 1n this relation, that their doing so 1s something 
that can be known about in many different ways It 1s not restricted ın 
principle to any one mode of access We do not have fully objective 
thought unless a thinker manifests some grasp of a conception of 
objectivity with this character 

It 1s here that my discussion connects up somewhat more closely 
with McDowell, despite our major differences For he insists that only 
with conceptual contents does one have a certain kind of objectivity— 
and I have recently been asserting Just that The reasons are not, of 
course, precisely the same I have not, for instance, committed myself 
to the view that there 1s properly objective thought, thought about 
something conceived as objective, only if the subject actually has the 
concept of perceptual experience All the same, I am in agreement 
with McDowell that the minimal requirements for thought about an 
objective world that is in some way conceived as objective are not met 
simply by the possession of perceptual states with nonconceptual 
representational content 

A similar point holds also for a subject’s conception of himself, 
herself (or perhaps, in the case of some mere animals, itself) The 
reasoning 1s essentially the same it relies again on the distinction 
between the exhaustive and the canonical Suppose a thinker’s only 
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way of establishing contents about himself 1s by the use of the follow- 
ing 


(a) By perception—‘I am 1n front of a lake’, ‘I am under a tree’ 

(b) By memory— ‘I was m front of a lake’, ‘I was under a tree’ 

(c) By mtegraung the deliverances of perception into a cognitive map 
and extracting the consequences thereof—‘There’s a wood behind 
me’, ‘There’s a lake an afternoon’s walk from the wood in that 
direction’ 


That 1s, for this thinker perception, memory, and the operation of 
his cognitive map 1n the ways illustrated above exhaust his ways of 
establishing things about himself 1? This ıs not yet to have a full 
conception of himself, as an object in the objective order of things In 
particular, 1t involves no conception of what it would be for some- 
thing to be true of himself that 1s not establishable as true by the 
above methods Contents not so establishable include “When I was 
asleep, I rolled over four times’, ‘There are times I existed of which 
I know nothing’ Full objective thought about oneself must, as in the 
case of conceptual thought about properties and relations, involve 
some conception of oneself as an object truths about which are not 
exhausted by those which can be established in a certain restricted set 
of ways The subject who is oneself may exist 1n times, places, and 
circumstances to which one's current means of establishing first- 
person contents cannot reach 

I conjecture that this distinction, m point of grasp of objectivity, 
between the minimal requirements for having states with nonconcep- 
tual contents, and what 1s involved in conceptual content, 1s a deeper 
reason why perceptual content cannot be explained in terms of 
conceptual content The most primitive aspects of representational 
content ın perception, which our subjective experience shares with 
the mere animals, do not involve the grasp of objectivity required for 
conceptual content This 1s one of the reasons that trying to treat all 
perceptual content as conceptual involves an overascription We 
should always distinguish between content that 1s objective, and 
content which 1s not only objective, but which 1s also conceived of as 
objective 

CHRISTOPHER PEACOCKE 
New York University 


19 On nonconceptual self-consciousness, see Hurley, chapter 4 
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New Honzons in the Study of Language and Mind NOAM CHOMSKY, with 
a Forward by NEIL SMITH New York Cambridge University Press, 
2000 xvi + 230 p Cloth $54 95, paper $19 95 


Here are seven essays that describe and deplore a philosophical 
double standard that respects the methods and results of physics, 
chemistry, and biology but not the methods and results of linguistics 
and other sciences of the mind One sign of the double standard 1s 
that, while hardly anyone thinks one can do philosophy of physics 
without knowing physics, 1t 15 all too common to think one can do 
philosophy of language without knowing linguistics 

Noam Chomsky is, of course, the leading figure in contemporary 
linguistics Starting in the 1950s, his development of generative gram- 
mar was an important factor in the shift from behavioristic to cogni- 
tive approaches to language and mind Chomsky’s approach takes 
the goal of linguistics to be to characterize the human faculty of 
language, noting its differences from the human faculties for general 
problem solving and science As Chomsky and other linguists tned to 
give explicit characterizations of the competence of a speaker of a 
language like English, ıt became clear that a child learning a lan- 
guage simply does not have the sort of evidence available that would 
enable it to learn the relevant principles from scratch There 1s a 
"poverty of the stimulus” The child must be prepared to acquire a 
language with these principles in a way that it 1s not prepared to 
acquire the principles of, say, physics or quantification theory 

It 1s clear that normal children acquire a language that reflects 
their particular linguistic environment A child brought up in Japan 
acquires a version of Japanese The same child brought up in Brazil 
acquires a version of Portuguese So, these languages must in some 
sense reflect some of the same underlying innate principles 

Further reflection along these lines and a great deal of empirical 
study of particular languages has led to the “principles and parame- 
ters” framework which has dominated linguistics in the last few 
decades The idea 1s that languages are basically the same in struc- 
ture, up to certam parameters—for example, whether the head of a 
phrase goes at the beginning of a phrase or at the end Children do 
not have to learn the basic principles, they only need to set the 
parameters Linguistics aims at stating the basic principles and pa- 
rameters by considering how languages differ in certam more or less 
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subtle respects The result of this approach has been a truly amazing 
outpouring of discoveries about how languages are the same yet 
different 

More recently, there have been attempts to try to explain some of 
the basic principles on the assumption that the language faculty 1s 
close to an ideal engineering solution to a problem of connecting the 
language faculty with the cognitive system and the articulatory per- 
ceptual system This “minimalist program” remains highly specula- 
tive, but whether of not it succeeds, contemporary linguistics as a 
whole has been a tremendous success story, the most successful of the 
cognitive sciences 


I SCIENTIFIC VERSUS ORDINARY NOTIONS 

One would therefore expect that any philosopher of mind or lan- 
guage would make it his or her business to understand the basic 
methodology and some of the results of this subject But many 
philosophers of mind and language proceed 1n utter ignorance of the 
subject 

For example, it 1s or ought to be well known that linguists do not 
theorize using the ordinary notion of language 1n which German and 
Dutch are two different languages and Chinese 1s a single language 
Language ın this ordinary sense 1s a political or social notion Speak- 
ers on both sides of the Dutch-German border understand each other 
quite well, although some are counted as speakers of German and 
some as speakers of Dutch, and both have trouble understanding 
other speakers of German There are great differences between 
Chinese speakers, yet all are counted as speaking the same language, 
Chinese, whereas people in France and Portugal are not counted as 
speaking the same language, although their differences are much less 
than differences among Chinese The social and political aspect of 
this conception of language 1s brought out in the quip that a lan- 
guage ıs a dialect with an army and a navy So, some theorists at least 
take the unit of language to be a dialect or even an individual 
*idiolect"—the very particular dialect of a particular person, with no 
assumption that any two people have exactly the same idiolect In 
fact, none of these notions—language, dialect, 1diolect—plays any 
role m contemporary linguistic theory, which 1s concerned with 
characterizing the internal linguistic faculty 

Nevertheless, many philosophers persist 1n thinking that linguistics 
1s or ought to be the study of language 1n this ordinary sense They 
argue (or, rather, assert) that it ıs theoretically important to try to 
specify what 1t 15 to have a more or less complete grasp of a language 
in that sense and they suppose that people with an incomplete or 
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erroneous understanding of aspects of a language do or should defer 
to others who have a more complete understanding These philoso- 
phers believe furthermore that principles of language have to do with 
social rules or conventions among people who speak the same lan- 
guage 

This is true despite the fact that the principles of language of 
interest to contemporary linguistics are never explained as due to 
linguistic convention or social practice Contemporary linguistics 
supposes that the relevant principles are built into the language 
faculty and are therefore the same 1n all languages, up to parametric 
variation Language use is not a matter of rule following in any 
intelligible sense 

It ıs true that a speaker may defer to one or another elte with 
respect to pronunciation or conditions of application for terms just as 
a person may defer to another 1n all sorts of other ways The notion 
of “misuse” of language 1s relevant to “the study of the sociology of 
group identification, authority structure, and the like" (71), but it 1s 
not taken seriously in linguistics 


II MIND-BODY PROBLEM 

Chomsky raises other related issues in these essays For example, he 
argues that it 1s a confusion to suppose that there is a “mind-body 
problem " Before Newtonian physics, the mind-body problem was the 
problem of giving a mechanical explanation of mind, the presuppo- 
sition being that everything else could be given a mechanical expla- 
nation With the failure of that presupposition, the issue 1s unclear 

Just as philosophers of language often fail to disunguish between 
ordinary notions and scientific notions of language, so philosophers 
of mind often fail to distinguish between ordinary notions like belief, 
sensation, and desire, on the one hand, and notions that appear in 
scientific theories Chomsky observes that there 1s no more reason to 
think that notions of desire or belief will play a role in scientific 
psychology than to think that the ordinary notion of language will 
play a role 1n linguistics or that ordinary notions like desk and chair 
will play a role in physics (This does not mean there are not desks, 
chairs, languages, desires, or behefs Only that these notions are not 
suitable for scientific purposes) Similarly, there ıs no reason to 
assume, as many philosophers do, that mental "representations" ap- 
pealed to in psychological or linguistic theories must represent things 
in the world 

Contemporary philosophy of language 1s sometimes concerned 
with alleged relations between expressions and things, denotation, 
reference, where there is a certam amount of appeal to “intuitions " 
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Chomsky observes that we cannot have intuitions about these things 
deriving from our language faculty any more than we can have such 
intuitions about angular momentum 

Furthermore, it 1s important to distinguish ethnoscience, which 
might be concerned with how people normally understand things, 
with physics or psychology or linguistics, which 1s concerned with 
what 1s actually the case Chomsky argues that philosophers are often 
best understood as doing ethnoscience, although without adhering to 
normal standards of empirical inquiry. In any event, the study of the 
semantic resources of the language faculty 1s to be distinguished from 
the study of ordinary conceptions of meaning 


III WORDS 

Another theme ın these essays has to do with the semantic represen- 
tation of words in a natural language Chomsky argues that the 
underlying semantic representation of most words 1s quite compli- 
cated, often 1nvolvng intricate and highly specialized perspectives 
involving human interests and concerns, providing various analytic 
connections in ways that could not be learned from scratch, so there 
is a “poverty of sumulus” argument here 

Here are some examples If I say that I painted my house brown, 
you will understand me to mean I painted 1t brown on the outside 
That ıs the default or unmarked case But I can also say that I painted 
my house brown on the inside To chmb a mountain 1s to go up, but 
you can also climb down the mountain If I am inside my bouse I 
cannot see 1t (except perhaps through a window if an exterior surface 
1s then visible) I am not near my house if I am inside 

Words offer conflicting perspectives The book I just took out of 
the library has a red cover, took four years to write, weighs two 
pounds, and has been translated 1nto several languages I can paint 
the door brown and then walk through it After the bank lowered 
interest rates, 1t burned down and was rebuilt across the street The 
pronoun m this case can refer to a financial insttution but cannot be 
used to refer back to a river bank, as 1n, ‘After the bank lowered 
interest rates, there was a flood and the river overflowed it’ This may 
relate to a worry some philosophers have about the idea that to know 
something you must believe ıt and 1t must be true What is known is 
a fact, what ıs believed ıs a proposition Some philosophers therefore 
object that what you know cannot be what you believe But perhaps 
that 1s lıke objecting that the baby cannot both finish the bottle and 
then break it 

In this brief review I have been able to mention only a small part of 
the riches contained and I have not really been able adequately to 
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defend Chomsky’s complaints against philosophers of language and 
philosophers of mind, although I am convinced that he 1s nght In 
any event, this collection should be read and studied carefully by all 
philosophers of mind and language 

GILBERT HARMAN 
Princeton University 


Collected Papers JOHN RAWLS Cambridge Harvard University Press, 
1999 xxi + 656 p Cloth $39 95 


Long before the publication of A Theory of Justice! John Rawls deeply 
influenced moral and political philosophy through a series of articles 
that remain classics Again in the years before the publication of 
Political Laberalsm,? Rawls published important papers that foreshad- 
owed the shift in perspective reflected in that second book His more 
recent papers develop the idea of public reason central in Political 
Liberalism and lay the foundation for his The Law of Peoples? Rawls had 
been reluctant to publish his collected papers, apparently because he 
saw them as "experimental," putting forth ideas to be reworked after 
further reflection and discussion Until now, the papers have been 
scattered about in various journals and books, some of which are 
difficult to find Now these significant papers have been collected in 
one volume, well edited by Samuel Freeman 

Collected Papers brings together nearly all of Rawls's published pa- 
pers These reflect the development of Rawls's thought from twenty 
years before A Theory of Justice, through his transition to Political 
Liberahsm, and beyond The collection includes twenty-five papers 
plus the Preface for the French Edition of A Theory of Justice, and an 
interview published in Commonweal Freeman has provided a brief 
informative Preface and a thorough analytical index similar ın style 
and scope to the indices that Rawls provided for A Theory of Justice and 
Political Liberalism 

The essays ın Collected Papers fall mto several groups The first ten 
essays were written from 1951 to 1971, the period in which Rawls was 
constructing A Theory of Justice The next five, written in the 1970s, are 


1 Cambndge Harvard, 1971 
2 New York Columbia, 1993 
3 Cambridge Harvard, 1999 
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primarily extensions of A Theory of Justice, explaining some of its ideas 
more fully and responding to various objections Next, six papers 
(and the Preface to French Edition of A Theory of Justice) represent 
Rawls’s gradual transition to his political conception of justice as 
fairness, which culminates in Political Liberalism The last four essays n 
Collected Papers take us beyond Political Liberalism in several ways "The 
Idea of Public Reason Revisited” introduces modifications to be 
included in a later edition of Political Leberahsm “Themes ın Kant’s 
Moral Philosophy” is a sample of Rawls’s influential work on the 
history of ethics, more of which is now available m a volume edited by 
Barbara Herman, Lectures on the History of Moral Philosophy * The two 
remaining papers, “The Law of Peoples” and “Fifty Years After Hiro- 
shima,” are the basis for Rawls’s somewhat fuller discussion of inter- 
nattonal justice, The Law of Peoples 

A few of Rawls's previous publications are not contained m Collected 
Papers Some of these were omitted because they were early versions 
of papers later published in a revised and expanded form This was 
the reason for excluding the initial versions of “Justice as Fairness" 
and “The Basic Structure as Subject "? Three other essays were not 
reprinted 1n. Collected Papers because they are all republished ın the 
paperback edition of Political Liberalism These are “Reply to Haber- 
mas,” “The Basic Liberties and Their Priority,” and a later version of 
“The Basic Structure as Subject "? Finally, Collected Papers does not 
include a few short pieces written for special occastons These are 
“Review of Toulmin,” “Reply to Lyons and Teitelman,” and “Roderick 
Firth His Life and Work "7 

Collected Papers should be welcomed by different sorts of readers for 
a variety of reasons First, they provide a convenient packaging of 
essays with which, m addition to Rawls’s books, any professional 
moral and political philosopher should be acquainted Early papers, 
such as “Two Concepts of Rules” and “Outline of a Decision Proce- 
dure for Ethics,” make significant proposals beyond what 1s devel- 
oped in A Theory of Justice It also may be instructive to see why Rawls 


* Cambridge Harvard, 2000 

5 Respectively, in. Philosophical Review, xvii, 2 (April 1958) 164-94, and in Alvin 
I Goldman and Jaegwon Kim, eds, Values and Morals Essays in Honor of William 
Frankena, Charles Stevenson, and Richard Brandt (Boston Reidel, 1978), pp 47-71 

$ Respectively, ın this JOURNAL, XCII, 8 (March 1995) 132-80, in Sterling Mc- 
Murrin, ed , The Tanner Lectures on Human Values (Salt Lake City Utah UP, 1983), 
pp 3-87, and ın Amencan Philosophical Quarterly, xiv (1977) 159-65 

7 Respectively, in. Philosophical Revieu, LX, 4 (October 1951) 572-80, in this JOUR- 
NAL, LXIX, 18 (October 5, 1972) 556-57, and in Philosophy and Phenomenological 
Research, 11, 1 (March 1991) 109-18 
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abandoned ideas in earlier papers for example, withdrawing a major 
argument about “real risks" in “Constitutional Liberty and the Con- 
cept of Justice” 1n response to an objection by Gilbert Harman It 
remains an important, and much debated, question why Rawls devel- 
oped a modified position ın Poltcal Liberahsm and whether the 
changes were, as he thought, necessary improvements The interven- 
ing essays help to clarify this Some essays are important supplemen- 
tary clarifications of points in the books, and the later essays lay out 
Rawls's interpretation of Kant's ethics and his treatment of interna- 
tional Justice 

Second, Collected Papers should interest students, teachers, and 
intelligent general readers who want to understand some main 
themes but are not prepared to read all of Rawls's work The set of 
papers include some quite readable and challenging essays that are, 
compared to the books and other papers, not so heavily laden with 
detailed argument, replies to objections, and background presuppo- 
sitions “Justice as Fairness” and “Justice as Reciprocity” may be the 
best examples of this Having the whole set together, of course, 
encourages further exploration 

Third, the volume 1s a valuable resource for scholars and students 
in any discipline that raises questions about the many topics that 
Rawls addresses Thanks to the excellent index to Collected Papers, one 
can easily find where Rawls comments on various philosophical con- 
cepts, social issues, and other theories and authors, and his comments 
are invariably worth consulting 

Very briefly, some main lines of the moral and political philosophy 
expressed in Collected Papers as well as Rawls’s previous books are 
these The work starts from a conviction that we cannot satisfactorily 
address particular questions of justice without first determining the 
demands of justice on the basic structure of society The initial step 
toward evaluating traditional principles for this purpose was to con- 
struct the idea of an “onginal position” which 1s arguably the appro- 
priate standpoint for choosing among them On this basis, Rawls 
then argues that two principles of Justice he articulates are preferable 
to other traditional principles, such as utilitarianism and perfection- 
ism Rawls's principles insist on the priority of constitutional liberties, 
a strong requirement of “fair equality of opportunity," and a distrib- 
utive principle that opposes inequalities in the basic structure that are 
not to the benefit of the least well off Rawls attempts to confirm his 
results by applying the principles to show that they tend to match our 
“considered judgments” regarding particular matters Then he con- 
siders how a “sense of justice” might develop to make a just society 
stable Early papers, like his first book A Theory of Justice, do not 
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sharply separate moral and political issues, and they suggest aspira- 
tions for developing a comprehensive, true moral theory Later pa- 
pers, like the later book Political Liberahsm, consider the basic theory 
(“justice as fairness”) only as a “political conception” on which, 
arguably, 1t 1s possible to develop an “overlapping consensus” among 
those who believe 1n diverse, but “reasonable,” comprehensive moral 
and religious doctrines Here, justification 15 sought by emphasizing 
potentially shared political values compatible with a wide diversity of 
views about morality and religion Thus the principles of yustice as 
fairness are presented without insisting on them as “true” or Justifi- 
able regardless of historical conditions Rawls’s papers as well as his 
books reflect his influential views on how moral and political theories 
can be constructed and justified We are to aim, not for deductive 
proofs from self-evident premises, but rather for a “reflective equilib- 
rium" in our thinking after we have surveyed all aspects of a theory 
and its alternatives in the light of whatever relevant facts and argu- 
ments we can think of 
Rawls’s special vocabulary, with terms such as “reflective equilib- 
rium,” “original position,” "primary goods,” “the difference princi- 
ple,” and the hke, has become part of the common currency of 
philosophical discourse He has been often, and rightly, honored for 
his innovative, thorough, systematic, and deeply thoughtful treat- 
ment of the central problems of social justice His work 1s also valued 
for the way it reconstructed and revitalized 1deas prominent in the 
social-contract tradition and Kantian moral philosophy His ideas 
have been controversial, but virtually all critics readily acknowledge 
the value of seriously engaging with his thought But what has ın- 
spired generations of Rawls’s students, more than any particular set of 
ideas, has been Rawls's example as a teacher, philosopher, and 
human being The qualities they admire will be evident, I expect, to 
any reader of Collected Papers For example, although the papers focus 
on abstract principles and arguments in a characteristically 1mper- 
sonal style, the reader can hardly fail to notice the author’s humanity, 
fairness, intellectual integrity, and deep commitment to the pursuit 
of reasonable responses to social injustice 
THOMAS E HILL, JR 
University of North Carolina/ Chapel Hill 
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THE INTRANSITIVITY OF CAUSATION REVEALED `“ ~% 
IN EQUATIONS AND GRAPHS* i 


e live ın exciting times By ‘we’ I mean philosophers study- 

ing the nature of causation The past decade or so has 

witnessed a flurry of philosophical activity aimed at crack- 
ing this nut, and, surprisingly, real progress has been made Two 
developments are especially worthy of note ! First, there has been a 
resurgence of interest in the counterfactual theory of causation, given 
its best-known formulation by David Lewis ? Second, there has been 
increasing philosophical interest in the techniques of causal model- 
ing developed and employed within fields such as econometrics, 
epidemiology, and artificial intelligence ? These two developments 
have been largely independent and addressed to different sorts of 
problems Work in the counterfactual traditon has been primarily 
concerned with issues involving “token” or “singular” causation, while 
work m the second tradition has tended to focus on issues concerning 
“type-level” or “general” causation 


* For discussion and comments, thanks go to Martin Barrett, Paul Bartha, Ellery 
Eells, Clark Glymour, Alan Hajek, Dan Hausman, Mark Kalderon, Henry Kyburg, 
Michael McDermott, Laurie Paul, Augustin Rayo, Jonathan Schaffer, Elliott Sober, 
Jim Woodward, Stephen Yablo, and especially Judea Pearl 

! I do not mean to disparage other important developments in the theory of 
causation, such as the development of a theory of causal processes in terms of 
conserved quantities—see Phil Dowe's Physical Causation (New York Cambridge, 
2000)—but I shall not talk about them here 

“Causation,” this JOURNAL, LXX, 17 (October 11, 1973) 556-67, reprinted in 
Philosophical Papers, Volume IT (New York Oxford, 1986), pp 159-72 For examples of 
important recent work on counterfactual theories of causation, see the essays in this 
JOURNAL, XCVII, 4 (April 2000), and in John Collins, Ned Hall and L A Paul, eds 
Causation and Counterfactuals (Cambridge MIT, forthcoming) 

? See, for example, the essays in Vaughan R McKim and Stephen P Turner, eds , 
Causality ın Crisis? (Notre Dame University Press, 1997) 
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Fortunately, these two developments are just beginning to merge 
the computer scientists Judea Pearl and Joseph Halpern‘ have devel- 
oped theories of token causation 1n terms of structural equations of 
the sort used in causal modeling The account I shall present here 
bears a very strong resemblance to the theories developed by Pearl 
and Halpern I shall not present their account explicitly and contrast 
it with mine, but I urge philosophers to read this work on their own 

As a case study, I shall explore the problem of the transitivity of 
causation from within the structural equations framework Causation 
1s transitive 1f and only if, whenever a causes b and b causes c, then a 
causes cas well Many philosophers, notably Lewis, have claimed that 
causation 1s transitive Others? have raised powerful objections to the 
transitivity of causation I side with the objectors This 1s not to say 
that I shall offer a conclusive refutation of the transitivity thesis the 
transitivity fetishist may well be able to preserve her cherished prin- 
ciple by tweaking my account ın various ways But I shall show that 
there ıs no endependent motivation for accepting the transitivity thesis 
all of the benefits of maintaining transitivity can be had without 
paying the costs 

By working within the structural equations framework, I shall not 
be abandoning the traditional counterfactual framework Rather, I 
shall present structural equations as tools for representing patterns of 
counterfactual dependence These tools allow us to make explicit the 
structural differences between those cases which appear to instantiate 
the transitivity of causation, and those which appear to be counter- 
examples The structural equations framework draws attention to a 
certain kind of counterfactual whose importance to the analysis of 
causation has hitherto been ignored by philosophers I shall use this 
type of counterfactual to define the notion of an active route between 
two events If 1t 1s possible to give a reductive analysis of counterfac- 
tuals in purely acausal terms, as Lewis? believes, then I offer a reduc- 
tive analysis of active routes In those cases which appear to be 
counterexamples to the transitivity of causation, the putative cause 
and effect fail to be connected by an active route 


4 Pearl, Causality (New York Cambndge, 2000), chapter 10, Halpern and Pearl, 
“Causes and Explanations A Structural-model Approach," Technical report R-266, 
Cognitive Systems Laboratory, University of California/Los Angeles, 2000 

5 See especially Michael McDermott, “Redundant Causation,” British Journal for the 
Philosophy of Science, XL (1995) 523-44 

6 See, for example, “Counterfactual Dependence and Time’s Arrow,” Noûs, XIN 
(1997) 455-76, reprinted ın Philosophical Papers, Volume II, pp 32-52, and Philosoph- 
1cal Papers, Volume II, mtroduction 
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In focusing on the problem of transitivity, I shall have little or 
nothing to say about a variety of other problems 1n the theory of 
causation I shall assume determinism, and hence leave unsolved 
important problems involving indeterministic causation I shall dıs- 
cuss only briefly the problems of symmetric overdetermination, late 
preemption, and “trumping” My strategy will be to take Lewis’s 
account of causation as a foil,” argue that my account does a better 
job on the issue of transitivity, and say just enough about these further 
Issues to suggest that my account fares no worse 1n these arenas 


I TRANSITIVITY ANATOMY OF A PUZZLE 

Our point of departure will be Lewis's original counterfactual theory 
of causation Let c and e be distinct events that both occurred Then 
e counterfactually depends upon cif and only if, if c had not occurred, e 
would not have occurred Lewis takes counterfactual dependence to 
be sufficient but not necessary for causation causation 1s defined as 
the ancestral of counterfactual dependence, rendering causation tran- 
sitive by definition 

Lewis® offers a detailed account of what must be “held fixed” when 
evaluating the relevant counterfactuals For our purposes, it is 
enough to point to two features that these counterfactuals must have 
First, the counterfactuals must not backtrack If a caused c (but not vice 
versa), then a counterfactual of the form ‘If c had not occurred, 
then ' must hold a fixed If not, the theory would incorrectly rule 
that c causes a Second, the counterfactuals must "foretrack" if c 
causes e, we do not want to hold e fixed when evaluaung the coun- 
terfactual ‘If c had not occurred, then ’ If we do, the consequent of 
the conditional will obviously not be ‘e would not have occurred’ 

Note that these are restrictions on what ıs to be /acitly held fixed 
when entertaining counterfactual antecedents, they are not restric- 
tions on the antecedents that we are permitted to entertain The 
necessity of foretracking from c to e does not prevent us from enter- 
taining a counterfactual of the form ‘If c had not occurred, but e had 
occurred anyway, then ’ ? Let us call a counterfactual of this sort an 
exphetly nonforetracking or ENF counterfactual In the sequel, I shall 
argue that ENF counterfactuals should play a central role in the 
analysis of causation 


71 do not deny that other counterfactual theories of causation would make 
worthy foils Space limitations prohibit detailed comparison with all such rivals 

8 "Counterfactual Dependence and Time's Arrow” 

? Lewis himself makes this point in “Finkish Dispositions,” Philosophical Quarterly, 
xLvi (1997) 143-58, here p 150 
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Why does Lewis define causation as the ancestral of counterfactual 
dependence? There ıs certainly a strong pretheoretic intuition that 
causation 1s transitive, but this can be explained without building 
transitivity into the analysis Let us call a case of causation ordinary if 
it has this structure e depends counterfactually upon d, which in turn 
depends counterfactually upon c, and ealso depends counterfactually 
upon c Most cases of causation are ordinary, and in such cases we can 
explain why c counts as a cause of ejust by identifying causation with 
counterfactual dependence 

Trouble arises 1n "extraordinary" cases In many such cases, we 
Judge that there 1s causation without counterfactual dependence 
Here 1s a standard example 


“Backup” an assassin-in-training 1s on his first mission Trainee 1s an 
excellent shot if he shoots his gun, the bullet will fell Victim Supervisor 
1s also present, ın case Trainee has a last minute loss of nerve (a common 
affliction among student assassins) and fails to pull the trigger If 
Trainee does not shoot, Supervisor will shoot Victim herself In fact, 
Trainee performs admirably, firing his gun and killing Victim 


In this case, ıt seems that Trainee’s shot caused the death of Victim, 
even though Victim's death does not counterfactually depend upon 
Trainee’s shot This ıs a case of preemption by shooting at Victim, 
Trainee preempted a process that would itself have resulted 1n Vic- 
tim’s death The standard solution 1s to invoke the transitivity of 
causation Trainee's shot 1s a cause of Victim's death because there 1s 
a chain of counterfactual dependence running from the former to 
the latter Consider a further event (call ıt ‘b’) that was not mentioned 
explicitly in the foregoing description the presence of a bullet en 
route from Trainee to Victim Had Trainee not shot, ’ would not have 
occurred, and if b had not occurred, Victim would not have died 
Note the importance of the “no backtracking” rule in the second 
counterfactual if b had not occurred, Trainee would have shot any- 
way, so Supervisor would not have shot 

Other examples suggest, however, that causation 1s not transitive in 
general 


“Boulder” a boulder ıs dislodged, and begins rolling ominously toward 
Hiker Before it reaches him, Hiker sees the boulder and ducks The 
boulder sails harmlessly over his head with nary a centimeter to spare 
Hiker survives his ordeal 1° 


10 This example ıs from an early draft of Hall’s “Two Concepts of Causation,” 
forthcoming in Collins et alia 
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“Dog Bite" Terrorist, who 1s right-handed, must push a detonator button 
at noon to set off a bomb Shortly before noon, he 1s bitten by a dog on 
his nght hand Unable to use his right hand, he pushes the detonator 
with his left hand at noon The bomb duly explodes !! 


In each case, we have a chain of counterfactual dependence Hiker 
would not have ducked if the boulder had not fallen, and given the 
trajectory of the boulder, he would not have survived if he had not 
ducked The dog bite caused Terrorist to push the detonator with his 
left hand, and his pushing the detonator with his left hand caused the 
bomb to explode But the fall of the boulder did not cause Hiker to 
survive and the dog bite did not cause the explosion—these are the 
verdicts of common sense 

We should be particularly troubled that we judge there to be a 
causal relationship in that case where the chain of counterfactual 
dependence is hardest to see In “Backup,” we must find an interme- 
diate event that is not made salient in the presentation of the exam- 
ple, namely, b, and imagine it away while holding Trainee’s shot 
fixed We are to imagine the bullet vanishing into thin air or some 
such This is no homey piece of counterfactual reasoning, but re- 
quires the efforts of a trained philosopher In “Boulder” and “Dog 
Bite,” by contrast, 1t 1s easy to see the needed intermediate events 
Hiker’s ducking and Terrorist’s pushing the detonator with his left 
hand (respectively) These two events were specified as parameters of 
the examples Moreover, the necessary counterfactual reasoning 1s 
not particularly straming ıt ıs easy to imagine, for example, what 
would have happened if Hiker had not ducked, even holding fixed 
the boulder’s fall The defender of transitivity must maintain that we 
have inconsistently acute intuitive powers we see causal relations that 
are underwritten by obscure chains of counterfactual dependence, 
and yet we are blind to causal relations that are underwritten by 
obvious chains of counterfactual dependence 

I shall argue that we can accept “Dog Bite” and “Boulder” as 
counterexamples and provide an alternative account of “Backup ” 
Trainee’s shot caused Victim’s death, not because there 1s a chain of 
counterfactual dependence, but because there 1s an acte route be- 
tween Trainee’s shot and Victim’s death The existence of this route 
is revealed by an ENF counterfactual if Trainee had not shot, and 
Supervisor still did not shoot, then Victim would not have died There 
are no comparable ENF counterfactuals in “Boulder” and “Dog Bite ” 


11 From McDermott, “Redundant Causation ” 
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II. FURTHER PROBLEMS 

While my primary focus will be on the problem of transitivity, I shall 
briefly mention a few further problems with Lewis's theory along with 
hus recent attempt!? to solve some of them The first sort of problem 
1nvolves cases of symmetric overdetermination, ın which two events have 
an equal claim to be causes of a third event, which does not depend 
counterfactually upon either of them Lewis!? claims that he has no 
clear intuitions about such cases, and thus dismisses them as being of 
no diagnostic value 

A second type of problem involves cases of preemption that have 
different structures from “Backup " Lewis! uses the term early cutting 
to describe this structure In addition, he recognizes cases of trump- 
ing? and late cutting Here 1s an example of the latter 


Billy and Suzy both throw rocks at a bottle Suzy's throw gets there first, 
shattering the bottle Billy's throw arrives at the scene a split second 
later, encountering nothing but air where the bottle used to be !9 


Suzy's throw is clearly a cause of the bottle’s shattering, Billy's not 
The shattering of the bottle does not counterfactually depend upon 
Suzy's throw, and there 1s no chain of counterfactual dependence 
from Suzy's throw to the bottle’s shattering !? 

Lewis responds to these problems by modifying his theory of 
causation He first defines the notion of influence 


Where c and e are distinct actual events, c influences e if and only if 
there 1s a substantial range c;, c4, of different not-too-distant alterations 
of c (including the actual alteration of c) and there 1s a range €), €» of 
alterations of e, at least some of which differ, such that if c; had occurred, 
e, would have occurred, and if c; had occurred, e; would have occurred, 
and so on !8 


(An alteration c' of c 1s a fine-grained event that 1s similar to c, but 
possibly different ın matters of detail ) Suzy’s throw influences the 
shattering of the bottle had Suzy thrown shghtly earlier, or aimed at 
a slightly different point, the bottle would have shattered shghtly 


12 “Causation as Influence,” this JOURNAL, XCVII, 4 (April 2000) 182-97 

13 Both in “Causation” and in “Causation as Influence ” 

14 “Causation as Influence ” 

For a detailed discussion of trumping, see Schaffer, “Trumping Preemption,” 
this JOURNAL, XCVII, 4 (April 2000) 165-81 

1e This example appears ın “Causation as Influence,” but has been in the lore for 
some time 

17 For a defense of the latter claim, see “Postscript E" to “Causation” in Lewis’s 
Philosophical Papers, Volume II, pp 193-218 

18 *Causation as Influence," p 190 
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earlier, or 1n a slightly different way Lewis then defines causation as 
the ancestral of influence, citing the problem of early cutting pre- 
emption as his motivation for doing so !? 

I conclude this section by pointing to an interesting feature of 
Lewis’s new account it 1s infected with context sensitivity In order for 
c to influence e, there must be true counterfactuals involving a 
substantial range of not-too-distant alterations of c It may be an objective 
matter whether a set of counterfactuals of the form ‘If c, had oc- 
curred, e, would have occurred’ are true, but there is a further 
question about whether we should describe this pattern by saying 'c 
causes ¢ The answer to this question will depend, in part, upon 
which unactualized possibilities we consider “too distant” to take 
seriously This point ıs underscored in Lewis's?? brief discussion of 
preemptive prevention The account developed below will agree with 
Lewis’s new account on this point 


Ill EQUATIONS AND GRAPHS FOR DUMMIES 

The use of directed graphs to represent systems of causal relation- 
ships dates back at least to the work of Sewall Wright?! in the early 
1920s, the use of structural equations, pioneered by R Frisch,?? T 
Haavelmo?? and others in the 1930s and 1940s 15 not much younger 
Pearl's Causality 1s representative of the current state of the art ?* 

A system of structural equations 1s a sequence of equations & 
relating the values of variables belonging to some set “An ordered 
pair (Z &) will be called a causal model In the simplest case, a variable 


19 See especially “Causation as Influence," p 194 Note that in Lewis’s new 
account, transitivity would no longer be needed to handle “Backup” the precise 
time and manner of Victim’s death does indeed depend upon the precise tme and 
manner of Trainee's shot Lewis maintains that transitivity will sull be needed to 
handle fancier examples of early cutting I shall continue to use “Backup” in 
unmodified form, this will not change the fundamental structure of our central 
problem 

20 “Causation as Influence,” p 197 For more detailed discussion, see Collins, 
“Preemptive Prevention,” this JOURNAL, XCVII, 4 (April 2000) 223-34 

21 “Correlation and Causation,” Journal of Agncultural Research, xx (1921) 557-85 

22 “Statistical versus Theoretical Relations ın Economic Macrodynamics,” League 
of Nations Memorandum, 1938, reprinted as “Autonomy of Economic Relations" in 
D Hendry and M Morgan, eds , The Foundations of Econometric Analysis (New York 
Cambridge, 1995), pp 407-19 

25 "The Stausucal Implications of a System of Simultaneous Equations,” Economet- 
nica, XI (1943) 1-12, reprinted in Hendry and Morgan, pp 454-63, "The Probability 
Approach in Econometrics,” Supplement to Econometnca, X11, reprinted, in part, in 
Hendry and Morgan, pp 440-53, 477-90 

24 Philosophers will probably be most familiar with the work of Peter Spirtes, 
Clark Glymour, and Richard Scheines, especially their Causation, Prediction, and 
Search (New York Springer, 1993) This work is very much in the same vein, 
although I shall not be using their probabilistic framework here 
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will have only two values, and will represent the occurrence or non- 
occurrence of a certain event For example, in representing the 
causal relations in “Backup,” we shall use a variable T, which takes the 
value 1 when Trainee shoots, and takes the value 0 when he does not 
As a general convention, variables will be represented by italicized 
capital letters When a binary variable E ıs used to represent the 
occurrence or nonoccurrence of an event e, E = 1 will represent the 
occurrence of e, and E — 0 the nonoccurrence of e But variables need 
not be binary For example, the values of a variable could represent 
the mass of some object, or they could represent various alterations of 
a particular event 7 contains both exogenous and endogenous variables 

Each equation in & expresses the value of one variable, appearing 
on the left-hand side of the equation Each variable appears on the 
left-hand side of exactly one such equation. & 1s comprised of two 
subsets, & and & The former contains equations with exogenous 
variables on the left-hand side, while the latter contains equations 
with endogenous variables on the left hand side Equations in &, all 
take the simple form X — x tbey simply state the actual value of the 
variable in question Equations in & express the value of the endog- 
enous variable as a function of the values of other variables in the 
set Z7 


(4) Z=, (X,Y, , W) 


The syntax of such an equation ıs rıcher than that of ordinary 
mathematical equations In particular, structural equations are not 
symmetric (*) ıs not equivalent to fz (X, Y, , W) = Z. (In structural 
equations, side matters ) This 1s because structural equations encode 
counterfactuals For example, (*) encodes a set of counterfactuals of 
the followmg form 


If it were the case that X= x, Y 2 y, , W= u, then it would be the case 
that Z = f, (x, 9, , w) 


These counterfactuals are to be understood along the lines discussed 
1n section 1 above, ın particular, they do not backtrack 

Equations in & must always be written in minimal form if for all x, 
X,» z, ,U,fz(X y ,w)—fz(x,» , w), then the value of Zdoes not 
depend upon the value of X at all, and the structural equation for Z 
must be rewritten Z = fz (Y, , W) As written, then, the equation (*) 
says that the value of the variable Z depends counterfactually upon 
the values of the variables X, Y, W By the same token, equations in 
& must always include as arguments any variables in Y upon which 
Zcounterfactually depends, given the values of the other variables If, 
for some x x’, y, z ,w,fz(x 9 ,w)* fz(X,y , w), then the value 
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of Z does depend upon the value of X, and Z = f, (Y, , W)1s not in 
&, The correct equation for Z can be arrived at by expressing the 
value of Zas a function of all other variables in Y and then eliminat- 
ing those variables whose values are redundant given every assign- 
ment of values to the other variables 

It will sometimes be helpful to use symbols familiar from sentential 
logic to represent relations between variables The symbols —, \/, and 
/\ will represent the following mathematical functions 5X = 1- X, X 
V Y 2 max(X, Y}, X^ Y= min{X, Y} When the variables are binary, 
these functions behave much as the corresponding connectives do in 
sentential logic For example, if Z = X V Y, then Zwill take the value 
1 if and only if either. X or Y takes the value 1 

If the variable X figures as an argument on the right-hand side of 
the structural equation for the endogenous variable Z, then X ıs a 
parent of Z Exogenous variables have no parents within a model, 
while endogenous variables do 

A system of structural equations can be given an elegant graphical 
representation The variables in Y form the nodes of a graph The 
nodes are connected by directed edges or “arrows” according to the 
following rule an arrow ıs drawn from X to Z if and only if Xis a 
parent of Z A directed path from variable X to variable Z1s a sequence 
of arrows lined up "tip-to-tail" connecting X with Z A variable ıs 
exogenous if there 1s no arrow directed into it 

If the structural equations can be ordered so that no variable 
appears on the left-hand side after having appeared on the nght- 
hand side, then the system of equations ıs acyclic. Equivalently, a 
system of structural equations 1s acyclic if no directed path in the 
corresponding graph runs from a given node back into itself. Intu- 
itively, an acyclic system of equations represents a causal structure in 
which there are no causal loops All of the systems considered here 
will be acyclic If a system of equations 1s acyclic, then the system of 
equations has a unique solution That is, the values of the exogenous 
variables together with the other structural equations entail a unique 
value for every variable 

I shall illustrate structural equations and graphical representations 
using "Backup " The causal graph is depicted in figure 1 The varı- 
ables are to be interpreted as follows T= 1 corresponds to Trainee’s 
shooting, T = 0 to his refraining, 5 = 0 or 1 depending upon whether 
Supervisor shoots, B = 0 or 1 depending upon the presence of a 
bullet in flight at some point along the line from Trainee to Victim, 
and V= 0 or 1 according to whether Victim dies The set of structural 
equations 1s 
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SE T=1,S=AT, B=T, V=BVS 


T ıs an exogenous variable The equation, V = B V S encodes the 
following counterfactuals if either B or S were to take the value 1—f 
either b or Supervisor’s shot were to occur—then Victim would die, 1f 
B and S were both to take the value 0, then Victim would not die 
ZE has the following unique solution 


T-1,5-0,B-1,V-1 


That ıs Tramee actually shot, Supervisor did not shoot, b occurred, 
and Victim died 

It has become common 1n the philosophical literature to represent 
causal relationships ın terms of neuron diagrams A circle 1s used to 
represent an event which may occur (“fire”) or not A shaded circle 
represents an event that did occur, while a hollow circle represents an 
event that did not occur Neurons may be connected in one of two 
ways by a stimulatory connection, represented by an arrow, or by an 
mhibitory connection, represented by an arrow with a backward 
head A stimulatory connection between two events indicates that, if 
the first occurs, 1t will cause the second to occur, an inhibitory 
connection indicates that, if the first occurs, 1t will prevent the second 
from occurring (inhibitory connections override stimulatory ones) 
Causal graphs work similarly, but with several important differences 
First, each vertex represents a vanable A variable may be binary, 
taking values of 1 or 0 according to whether some event occurs or not, 
but it may also be multivalued Second, the causal graph does not 
itself specify the actual value of the variable, that information 1s 
contained in the accompanying set of structural equations Third, an 
arrow from one variable to another says nothing about the kind of 
connection that exists between them— once one allows nonbinary 
variables, the possible connections cannot be classified neatly into 
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stimulatory and inhibitory ?5 I have argued elsewhere? that attempts 
to shoe-horn all causal relationships into these two types lead to a 
number of pseudo-problems in the theory of causation Instead, the 
nature of the connection between variables 1s represented in the 
corresponding system of structural equations 

Each equation in &, encodes counterfactual information. Note, 
however, that the equations in & do not directly represent all coun- 
terfactuals that are true of the system Rather, & 1s a set of fundamental 
equations from which all other counterfactuals may be derived In 
general, in order to evaluate the counterfactual ‘If ıt were the case 
that X= x, Y=y, , W= u, then ’, we replace the equation for each 
of these variables with the identity stipulated, for example, we replace 
the equation for X with X = x The equations for the other variables 
remain unchanged In effect, this creates a new set of structural 
equations ın which X, Y, , Ware exogenous variables Graphically, 
the arrows directed into these variables are removed, while all other 
arrows remain intact Instead of these variables having their values 
causally determined in the normal way, they are “miraculously” set to 
the new hypothetical values The values of the remaining variables 
can then be computed The result may be thought of as characteriz- 
ing the “closest possible world(s)" where X = x, Y= y , W= w, are 
all true A variable Z depends counterfactually upon a variable Xin a 
system of structural equations if and only i£ in the actual solution, X = 
x, Z = z and there exist x’ # x and z’ + z such that the result of 
replacing the equation for X with X = x' yields Z = z’ This says that 
there 1s some possible value of X such that if X had taken that value, 
then the value of Z would have been different 

The standard treatment of “Backup” requires the counterfactual 
‘If the bullet had not been ın flight from Trainee to Victim, then 
Victim would not have died’ Modifying “& by setting B = 0, we get 
the following modified set of structural equations 


SE T=1,S=AT, B=0, V=BvyS 


25 Indeed, even with binary variables, this problem can arise as soon as you have 
two “neurons,” each connected to a third Suppose that the third neuron fires if 
exactly one of the others does, then the connections are neither purely stimulatory 
nor purely inhibitory 

2e“A Generalized Probabilisuc Theory of Causal Relevance," Synthese, xcvi 
(1993) 335-64, “The Mishap at Reichenbach Fall Singular vs General Causation,” 
Philosophical Studies, xxvii (1995) 257-91, “Farewell to Binary Causation,” Cana- 
dian Journal of Philosophy, xxvi (1996) 267-82, “The Role of Contrast in Causal and 
Explanatory Claims,” Synthese, cvi1 (1996) 395-419 
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In the graphical representation of “&', the arrow from T to B is 
removed from figure 1 Solving, we have 


T=1,S=0, B=0, V=0 


So if the bullet had not been in flight, Tramee still would have shot, 
Supervisor would not have fired, and Victim would not have died— 
Just as required by the standard solution This example illustrates how 
counterfactuals do not backtrack within the structural equations 
approach 

A system of structural equations 1s an elegant means for represent- 
ing a whole family of counterfactuals of just the sort that Lewis’s 
counterfactual theory of causation depends upon The correctness of 
a set of structural equations, and of the corresponding graph, de- 
pends upon the truth of these counterfactuals If, as Lewis believes, 
the truth values of counterfactuals supervene upon noncausal facts, 
then the correctness of a set of structural equations does as well 


IV CAUSAL ROUTES 
Consider the causal structure depicted in figure 2 What does the 
arrow from X to Z mean in such a diagram? This 1s a fundamental 
question, whose answer serves to highlight an important difference 
between the structural equations approach to causation, and the 

counterfactual approach more familiar to philosophers 
Note first the arrows from X to Y and from Y to Z These indicate 
that the value of Z depends counterfactually upon the value of Y, 
which in turn depends upon the value of X So far, so good When 
these relationships hold, the traditional counterfactual approach to 
causation leads us to ask two further questions Does the value of Z 
depend counterfactually upon the value of X? If not, is the value of 
Xnonetheless a cause of the value of Z? But the arrow from Xto Zin 
figure 2 does not correspond to an affirmative answer to either of 
these questions Rather, the arrow from X to Z means that the 
structural equation for Z1s of the form Z = fz (X, Y), where Xis an 


Y 


Fıgure 2 
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essential argument there exist x, x’, y, such that fz (x, y) # fz (x, y) 
An arrow from X to Z thus means that the value of Z can depend 
counterfactually upon the value of X, even holding fixed the value of Y 
The natural causal interpretation of this counterfactual 1s that the 
value of X can have an effect on the value of Z over and above the 
effect 1t has in virtue of causing the value of Y There are two routes 
whereby X influences Z, one which runs through Y, and one direct 
route which bypasses Y The overall effect of Xon Zwill depend upon 
both of these routes 

Pay attention to the counterfactual that reveals the direct route 
from X to Zin figure 2 1t requires that we hypothetically change the 
value of X, while holding the value of Y fixed, and evaluate whether 
Z changes ın response This is an ENF counterfactual ıt holds fixed 
Y, an effect of X, in supposing X to take on a different value, 1t does 
not allow the counterfactual to foretrack from X to Y 

“Traditional” counterfactual approaches to causation—those in 
the tradition of Lewis— do not employ ENF counterfactuals They 
employ only counterfactuals with simple antecedents, antecedents 
that make stipulations about only one event As a result, they are 
unable to detect the direct route from X to Zin figure 2 Indeed, 
despite the extensive use of neuron diagrams by philosophers work- 
ing within the counterfactual tradition, neither I nor anyone I have 
polled has seen a diagram ın which three neurons are configured in 
the triangular pattern shown in figure 2! 

Whether a route ıs direct or not is relative to the variable set Z^ 
It may be, for instance, that relative to the variable set & this route 1s 
direct, while in the richer variable set Z/', the route from X to Z that 
bypasses Y is mediated by some further variable Win Z/'NZ/ If ıt 
1s possible to interpolate a variable W along the direct route from X 
to Z, the traditional counterfactual approach :s able to detect the 
existence of two routes from X to Z one via Y and the other via W 
This ability 1s attested to by the abundance of such “diamond-shaped” 
configurations in neuron diagrams Since the traditional counterfac- 
tual approach 1s able to distinguish causal routes only by interpolating 
variables, 1t 15 hardly surprising that the standard solution to cases of 
preemption like “Backup” is to interpolate and find a chain of 
counterfactual dependence But even when interpolation 1s possible 
and ENF counterfactuals are not needed to detect the existence of 
distinct causal routes, ENF counterfactuals provide crucial informa- 
tion about the nature of the distinct causal routes, and this 1s the key 
to solving our problem 

Returning to figure 2, note that the arrow from X to Zimplies only 
that there 1s some value y of Y such that f, (x, y) # fz (x', y)—such that 
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Z depends counterfactually upon X while fixing Y at y It may be, 
however, that given the actual value y of Y, Z does not depend upon 
X In such a case, we shall say that the direct route from X to Z 1s 
active Suppose, for example, that all three variables are binary and 
that Z = X/\ Y Then, if Y takes the value 1, the direct route from X 
to Zwill be acteve whether Z= 0 or 1 will depend upon whether X = 
0 or 1 (while holding Y= 1 fixed) In such a case, the value of X does 
in fact play a role in determining the value of Z, over and above the 
role it plays by determining Y On the other hand, if Y = 0, then the 
direct route from X to Z 1s inactive 

Let us formulate this distinction between active and inactive routes 
more precisely, at the same time generalizing to include cases in 
which there are more than two routes between two variables, and 
where the route being evaluated ıs not direct Let & be a system of 
structural equations on the variables in set Y A route between two 
variables X and Zin X 1s an ordered sequence of variables (X, Y; , 
Y,, Z} such that each variable ın the sequence is in Zand ıs a parent 
of its successor 1n the sequence Graphically, a route between X and 
Z1s a directed path from Xto Z A variable Y, distinct from both Xand 
Z, 1s intermediate between X and Zıf and only if it belongs to some route 
between X and Z Then 


"Act" The route (X, Y, |, Y,, Z) is actwein the causal model (Z/£) if and 
only if Z depends counterfactually upon X within the new system of 
equations &' constructed from & as follows for all Y € Z if Y is 
intermediate between X and Z, but does not belong to the route (X, 
Y, , Y,, Z), then replace the equation for Y with a new equation that 
sets Y equal to its actual value in & (If there are no intermediate 
variables that do not belong to this route, then &” 1s Just & ) 


The activity of a route 1s entailed by the truth of a certain kind of ENF 
counterfactual That 1s, the route 1s active if there 1s a true counter- 
factual of the form if the value of X had been x’, and the value of 
variables that lie along other routes from X to Z were held fixed, then 
the value of Z would have been different By holding fixed interme- 
diates along other routes, any influence of X on Z along those other 
routes 1s eliminated— hypothetical changes in X are not allowed to 
foretrack along those routes The relevant counterfactual thus 1so- 
lates the influence of X on Z along the route in question 

In analogy with Lewis’s definition of influence, we could emend 
“Act” to require that there be “substantial” counterfactual depen- 
dence of Zon Xin the new system of equations This would allow us 
to deny that a route is active, for example, if X1s multivalued and very 
few values of X lead to a different value of Z, or if Z1s multivalued, and 
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changes in the value of X lead only to minimal changes in the value 
of Z Such an emendation would sacrifice precision, but better fit the 
coarse grain of our ordinary causal judgments I shall leave this 
choice to the reader, the emendation will not be necessary for any of 
our central examples 

My central proposal 1s 


Let cand ebe distinct occurrent events, and let Xand Zbe variables such 
that the values of X and Z represent alterations of c and e respectively 
Then cis a cause of e 1f and only if there ıs an active causal route from 
Xto Zin an appropriate causal model (% &) 27 


What makes a causal model (% &) appropriate? There are at least 
three requirements The first two are objective the equations in & 
must entail no false counterfactuals, and they must not represent 
counterfactual dependence relations between events that are not 
distinct The third component is pragmatic Z/ should not contain 
variables whose values correspond to possibilities that we consider to 
be too remote I shall discuss these issues at greater length ın sections 
vu and vii below For now, note that these restrictions are present in 
Lewis’s recent account and hence are not peculiar to the present 
account 

In sections v, vii, and viri below, I shall return to our three central 
examples I shall show that ın “Backup,” there ıs a causally active 
route from Trainee’s shot to Victim’s death This gives us the result 
that Trainee’s shot causes Victim's death without the need to invoke 
transitivity This opens to door to the possibility of accepting “Boul- 
der" and “Dog Bite” as counterexamples to the transitivity of causa- 
tion, which I shall do I shall show that there 1s no active causal route 
from the dog bite to the explosion, or from the boulder’s fall to 
Hiker’s survival 


V “BACKUP” REVISITED 
Let us represent the causal structure of “Backup,” this tme omitting 
the esoteric variable B I claim as a virtue for my account that 
consideration of this variable 1s unnecessary (albeit harmless) The 
causal graph 1s depicted in figure 3, and the set of structural equa- 
tions 1$ 


27 In “A Tale of Two Effects" (forthcoming), I take a slightly different line 
Sometimes, our judgments of causation attach to individual active routes, at other 
times, we focus on the net effect of one variable on another along all routes taken 
together (and indeed there may even be cases where we attend to the effect along 
some intermediate number of routes) Nonetheless, it ıs the notion of dependence 
along a route captured by "Act" that seems to be at work in our judgments of token 
causation in the cases at issue here 
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B% T=1,S=AT, V=TVS 


The interpretation of the variables 1s the same as that given in section 
nr above Figure 3 clearly shows two distinct routes from T to V, and 
it follows from the equations H% that these two routes "cancel" V 
does not depend counterfactually upon T 


S 


TD ———_——— nD | 


Figure 3 


In order to see that the route (T, V) 1s active, we must show that V 
depends counterfactually upon T while holding the value of S fixed 
In fact, T= 1, S = 0, and V= 1 So we must show that if T had been 
0, and S had also been 0, then V would have been 0 We must solve 
for the new structural equations 


JA T=0,S=0,V=TVS 


It 15 easy to see that within this new system of equations, V must 
equal 0 

We have just demonstrated the truth of the following ENF coun- 
terfactual if Trainee had not shot, and Supervisor had not shot 
either, then Victim would not have died The truth of this counter- 
factual is straightforward, and could have been evaluated intuitively 
without the use of formal apparatus It 15 1n virtue of the truth of this 
counterfactual that Trainee’s shot 1s a cause of Victim’s death No 
wonder, then, that we have such an easy time judging that Trainee’s 
shot is a cause of Victim's death, this treatment of “Backup” 1s much 
simpler than that required by the standard solution And for those?? 
who worry about such other-worldly possibilities, the present account 
(unlike Lewis's) would work equally well if the assassins’ guns did not 


28 For example, McDermott (“Redundant Causation”) and Schaffer (“Trumping 
Preemption”) discuss cases involving magic spells that act at a spatiotemporal 
distance 
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kill by firing bullets, but rather by some sort of unmediated action at 
a distance 

Note that in order to know that S$ must be held fixed, we do not 
need to know that Trainee's shot caused Supervisor not to shoot We 
only need to know that the pattern of counterfactual dependence 1s 
as described by £77 Thus the need to hold fixed an event that 1s 
intermediate between Trainee’s shot and Victim’s death does not 
undermine the reduction of causation to patterns of counterfactual 
dependence 


VI FURTHER PROBLEMS REVISITED 

Before examining the two counterexamples to transitivity, I wish to 
return briefly to the problems discussed ın section 11 It should be 
clear that Lewis’s solution to the problems of late cutting and trump- 
ing preemption are readily adapted to the present framework We 
could let the variable S take on different values representing different 
alterations of Suzy's throw, likewise, we could let values of the variable 
B represent differences in the time and manner of the bottle's 
shattering Then the value of B will depend counterfactually upon the 
value of S, and the route from S to B will be active 

If the values of two variables X and Y symmetrically overdetermine 
the value of Z (for example, 1f all the variables are binary and Z = X 
V Y) then "Act" does not indicate an active route from either X or Y 
to Z To the extent that this ıs a shortcoming of “Act,” it 1s a 
shortcoming of Lewis’s theory as well Thus, these three further 
problems do not accord Lewis’s theory any advantage over that 
presented here 

It is worth bnefly noting that the structural equations approach 
suggests novel solutions to the problems of late cutting preemption 
and symmetric overdetermination Detailed accounts are provided by 
Halpern and Pearl ?? Their formulations are slightly different from 
mune, but their proposals are readily adapted to the account sketched 
above Consider first the case of late cutting preemption Even with- 
out considering fine differences in the way Suzy throws her rock, or 
in the way the bottle shatters, 1t can be shown that there 1s an active 
route from Suzy’s throw to the bottle’s shattering This 1s revealed by 
the following ENF counterfactual given that Billy's rock did not hit 
the bottle, 1f Suzy had not thrown, the bottle would have remained 
intact throughout the incident 

The problem of symmetric overdetermination can be handled by 
weakening “Act” For reasons of space, I will provide only a compact 


?? In Pearl, Causality, and Halpern and Pearl, "Causes and Explanations ” 
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Figure 4 


presentation of the technical details Let (X, Y, , Y,, Z) bea route 
between the variables X and Z Suppose that the actual values of the 
variables on the route (other than X) are Y; = y, ,Y,—- y, Z-—z 
Let (W;, , Wn} bea set of variables 1n X that does not belong to the 
route (X, Y, , Y,, Z), these variables need not be intermediate 
between X and Z Let uw; , Wm be possible values of the variables 
W, , Wm» respectively The values w;, , Wm lie in the redundancy 
range of the variables W, , W,, for the route ın question if the 
following counterfactual 1s true if it were the case that W, = w,, , Wm 
= w,, then 1t would be the case that Y; = yj, , Yp = Yn» Z= z In other 
words, we can set the values of the variables W, to w, without disturb- 
ing the variables (other than possibly X) on the route (X, Y; , Yp» 
Zy It should be obvious that the actual values of W, , W,, will he in 
the redundancy range, but there may be nonactual values of these 
variables that do as well Indeed, cases of symmetric overdetermination 
arise precisely because nonactual values of these variables he in the 
redundancy range We may now provide our generalization of “Act” 


WA The route (X, Y, , Y,, Z) 1s weakly active relative to (27 &) 1f and 
only if there exists a set (possibly empty) of variables {W,, , W,}in 27 
(X, Y, , Y,, Z}, and values w, , Wm that lie within the redundancy 
range of these variables for this route, such that Z depends counterfac- 
tually upon X within the new system of equations &" constructed from & 
as follows for each W,, replace the equation for W, with a new equation 
that sets W, equal to w, ?? 


*Act" characterizes a special case where each W, lies on a route from 
Xto Zand 1s set to its actual value The reader may verify that WA 1s 
met 1n cases of symmetric overdetermination I fully grant that WA is 
less intuitive and less well-motivated than “Act” Since none of our 
central examples involves symmetric overdetermination, I shall con- 
unue to use “Act” in what follows ?! 


VII “DOG BITE" REVISITED 
“Dog Bite" 1s represented graphically in figure 4 The interpretation 
of the variables 1s as follows E — 0 or 1 depending on whether the 
explosion occurs, P — 0, 1, or 2, depending on whether Terrorist does 


30 Note the similarity to the definition of actual cause in Pearl and Halpern 
31 The reader may verify that WA ıs not met in either “Dog Bite” or “Boulder ” 
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not push the detonator at noon, pushes ıt with his right hand, or 
pushes 1t with his left hand (respectively), and D = 0 or 1 depending 
whether the dog bites his right hand or not, shortly before noon 
Note that the variable P ıs not binary ıt does not merely represent 
whether he pushes the detonator with his left hand or not, but 
represents whether he pushes the detonator at all, and if so with 
which hand The structural equations for this system are 


ØP D=1,P=1+D,E=PA1 


In words the dog does bite Terrorist's right hand shortly before 
noon, Terrorist will push the detonator with his right hand at noon 
unless the dog bites his right hand, ın which case he will push the 
detonator with his left hand, and the bomb will explode just in case 
he pushes the detonator with his right or left hand at noon In fact, 
the values of these variables are D = 1, P= 2, E= 1 Suppose the dog 
had not bitten his hand Then we set D = 0, yielding the values P = 
land E= 1 Thatis, if the dog had not bitten his right hand, Terrorist 
would have pushed the detonator with his right hand at noon and the 
bomb would still have exploded This system of equations yields the 
intuitively correct result that the explosion does not counterfactually 
depend upon the dog bite 

The dog bite 1s not a cause of the explosion there 1s only one route 
from D to E, namely, (D, P, E) and that route is not causally active In 
disanalogy with “Backup,” the failure of E to counterfactually depend 
upon D1s not due to cancellation along different routes, but rather to 
what we might call a failure of composition While the function fp from 
the variable D to P ıs nontrivial, and the function f, from P to E 1s 
nontrivial, the composite function f, °fp from D to Eis trivial 

L A Paul? argues that “Dog Bite" (or rather a case structurally 
identical to it) does not constitute a violation of the transitivity of 
causation She maintains that the effect of the dog bite ıs different 
from the cause of the explosion, one 1s a push qua thing done with 
left hand, the other a push qua push of detonator button I take this 
to be a proposal about how to use causal language to describe the 
pattern of dependence captured by ZP Perhaps it 1s possible to 
regiment our causal language 1n such a way that failures of compo- 
sition need not be described by triples of the form ‘a causes b’, ‘b 
causes c, and ‘a does not cause ¢ But since the transitivity of 
causation 1s not a principle of which we have independent need 
(section v), and since there are counterexamples to the transitivity of 


32 “Aspect Causation,” this JOURNAL, XCVII, 4 (April 2000) 235-56 
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causation which do not rest on failures of composition (section VIII), 
the desire to preserve the transitivity of causation provides a poor 
motive for so regimenting our language 

We have shown that there ıs no active path from D to E ın the 
model ((D, P, E), 2:8) Perhaps there ıs another choice of (X £Y that 
1$ equally appropriate? Suppose, for example, that we represent the 
situation using the structure depicted in figure 5 Here P; = 0 or 1 
depending upon whether Terrorist pushes the button with his left 
hand at noon, and P, — 0 or 1 depending upon whether Terrorist 
pushes the button with his right hand at noon The corresponding set 
of structural equations would be 


Z9' D=1, P= D, P,= AD, E= P,\/ P, 


This structure gives us two routes from D to E Holding P, fixed at its 
actual value of 0, E does counterfactually depend upon D This implies 
that there 1s an active route, (D, P,, E), from Dto E 

What makes this an inappropriate model for “Dog Bite"??? One 
minimum criterion of adequacy for a model is that 1t entail only true 
counterfactuals As it turns out, the very counterfactual that reveals 
the active route (D, P,, E) 1n this model ss false, or at best indetermi- 
nate That ENF counterfactual is given that Terrorist did not push 
the detonator button with his right hand, if the dog had not bitten 
Terrorist’s right hand, then (he would not have pushed the button 
with his left hand either, and) the bomb would not have exploded *4 
Given that he did not push the button with his right hand, why should 
the dog bite make any difference to whether he pushes it with. the 
left? He wanted the bomb to explode, would he not push the button 
with his left hand regardless of whether the dog bit his right? At any 


35 The following discussion has benefited especially from comments by Hausman 
and Sober 

34 Note that ZP does not entail that this counterfactual 1s true or that ıt 1s false 
Substituting P # 1 does not yield a determinate solution for either P or E 


THE INTRANSITIVITY OF CAUSATION 293 


P, = E 


ins 4 


Figure 6 


rate, 1t seems wrong to say that he would definitely not have pushed 
the button with his left hand 

The problem with figure 5 and ZF’ 1s that they represent the dog 
bite as having two distinct effects 1t prevented Terrorist from pushing 
the button with his right hand, and caused him to push it with his left 
These effects are not genuinely distinct the dog bite caused Terronst 
to push the button with his left hand only 1nsofar as 1t prevented hun 
from pushing it with his right. This intuitive distinction 1$ made 
precise using ENF counterfactuals Terrorist’s pushing the button 
with his left hand ceases to depend counterfactually upon the dog 
bite when we specify whether or not he pushed the button with his 
nght hand 

We might alternately represent “Dog Bite" by YZ" and figure 6 


QE" D=1, P, =D, P,= -P,, E= P,V P, 


This captures the idea that the dog bite causes Terrorist to push the 
button with his left hand by preventing him from pushing with his 
night 

Figure 6 and Z/£"' do not entail the troublesome ENF counter- 
factuals discussed above, but they are inappropriate for a different 
reason It was specified in the example that Terrorist had to press the 
detonator at noon ın order for the bomb to explode Accordingly, the 
variables P; and P, must represent whether or not Terrorist presses 
the button with the relevant hand at noon, or else the equation for E 
will be false It 1s thus inappropriate to think of these variables as 
standing in a causal relanonship Terrorist pushed the button at noon 
with his left hand ratber than his right, but bus failure to push the 
button with his right hand did not cause him to push the button with 
his left hand. The two events (or rather, the one omission and the 
one event) fail to be distinct 1n the way required for the one to be a 
cause of the other Invoking the terminology of Lewis’s new theory, 
pushing with the left band and pushing with the right are alterations 
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of the same event We capture this idea by representing the two as 
distinct values of a single vanable, as we did in figure 4 and YB 

Figure 6 and Y%"’ would be an appropriate representation for a 
shghtly different case Suppose that there 1s a window of ume during 
which Terrorist can push the button Suppose, moreover, that his 
initial instinct 1s to push the button with his right hand, and only after 
he finds himself unable to do so will he push the button with his left 
In this variant of the story, Terrorist’s failure to push the button with 
his right hand does indeed cause him to push the button with his left 
hand a moment later In this version of the example, we still have the 
intuition that the dog bite did not cause the explosion, and I owe an 
account of this It turns out that this structure (or rather a crucial part 
of 1t) 1s isomorphic to that of “Boulder,” which I discuss ın the next 
section 

Consider one final alternative representation Suppose we replace 
D with a three-valued variable the dog might do nothing, bite Ter- 
rorist’s right hand, or maul Terrorist severely If the dog mauls 
Terrorist, he will not push the button (being totally incapacitated) 
and the bomb will not explode Now the route (D, P, E) in figure 4 
will be active whether the bomb explodes will depend upon what the 
dog does But this ıs as it should be if we are willing to take seriously 
the possibility that the dog might have mauled Terrorist severely, we 
might well judge that the dog's (merely) biting his nght hand (rather 
than mauling him) ıs a cause of the explosion °° Assume, however, 
that this 1s not a possibility that we wish to take seriously (did you 
consider this possibility before I mentioned ıt?) then this new model 
is 1nappropriate for pragmatic reasons This mirrors Lewis's claim 
that in order for the event c to influence e, the alterations of c that 
yield different alterations of e must be “not-too-distant " 

Of course, I cannot rule out every rival model of “Dog Bite” I 
maintain, however, that of those which spring to mind, only the 
model captured by figure 4 and Y@1s appropriate In this model, 
there ıs no active route from the dog bite to the explosion This 
captures our intuitive Judgment that the dog bite 1s not a cause of the 
explosion In the course of discussing this example, we have illus- 
trated three different ways ın which a model might fail to be appro- 
priate it might license counterfactuals that are not true, it might 
posit causal connections between events that are not distinct, or it 


95 This is the sort of case where the contrastive approach that I have developed 
elsewhere (see the references in footnote 26 above) 1s helpful 


THE INTRANSITIVITY OF CAUSATION 295 


.—$< $$ 


Figure 7 


might encompass possibilities that are too distant from actuality to 
take seriously 


VIII “BOULDER” REVISITED 
Let us set up a model for “Boulder” (figure 7) The variable F — 0 or 
1 depending upon whether the boulder falls, D = 0 or 1 depending 
upon whether Hiker ducks, and 5 = 0 or 1 depending upon whether 
Hiker survives The structural equations are 


BO F=1, D=F,S=AF\VD 


(To see the isomorphism with Z/2"', identify P, with ~F, P, with D, 
and E with. S) 

In this model, there are no active routes from Fto S It is relatively 
easy to see that the direct route (F, S) ıs inactive Holding D fixed at 
its actual value of 1, there ıs no counterfactual dependence of Son F 
holding fixed that Hiker ducked, he would have survived if the 
boulder had not fallen The more important issue, however, 1s 
whether the indirect route (F, D, S)1s active It is along this route that 
we have a chain of counterfactual dependence, and hence causation 
on Lewis’s account There are no intermediates between Fand S that 
do not he along this route, and hence nothing to hold fixed in 
evaluating whether this route 1s active Since S does not depend on F, 
the route 1s inactive Intuitively, the falling boulder does not save 
Hiker’s life because without it, Hiker’s life would not have been 
endangered in the first place This 1s just what 1s indicated by the 
absence of an active route from Fto S there ıs no scenario ın which 
the boulder does not fall and Hiker does not survive 

Note that while the causal diagram for “Boulder” is isomorphic to 
that ın “Backup” (figure 3), the structural equations are not 1s0mor- 
phic Thus “Backup,” “Dog Bite,” and “Boulder” all exhibit different 
causal structures counterfactual dependence fails for different rea- 
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sons 1n each case By contrast, Lewis?? claims that all of the (putative) 
counterexamples to transitivity have a common structure This 1s not 
surprising it ıs only by considering ENF counterfactuals that the 
difference 1n structure 1s revealed If we consider only counterfactuals 
with single antecedents, as Lewis does, then all three cases have the 
same structure there 1s a chain of counterfactual dependence, but no 
dependence of the last event on the first 

There is a sense ın which the inactiity of the route (F, D, S) rests 
on a loophole in the definition of an active route The guiding idea 
behind that definition was to isolate the influence of one variable on 
another along a given route by factoring out any counteracting 
influences along other routes This was accomplished by bolding 
fixed the values of intermediate variables along those other routes 
This technique will not work, however, when one of the other routes 
1n question 1s direct a direct route has no intermediate variables, so 
its role cannot be factored out I maintain that this 15 no weakness in 
the definition, rather, 1t 15 the nature of direct routes that they are too 
intimately bound up with other routes to be separated from them 
Any effect the falling boulder may have made on Hiker's survival by 
causing him to duck cannot be conceptually severed from the risk 
that the falling boulder posed to him ın the first place Thus “Boul- 
der” 1s analogous to "Backup" in that S fails to depend counterfactu- 
ally upon F because of cancellation along two different routes, but 
"Boulder" ıs unlike “Backup” in that an active route cannot be 
isolated 

Is ıt possible to interpolate a variable along the route (F, S)? If so, 
then it may be possible to hold that variable fixed, and thus isolate the 
route (F, D, S} and show it to be active This can be done, but it turns 
out to be much more difficult than 1t might first appear The trick 1s 
to find a variable that does not also he along the route (F, D, S) The 
possibility of doing this rests upon the contingent fact that ıt takes 
Hiker a finite amount of tıme to react to the boulder and get into a 
safe position There will be a point on the boulder's trajectory—let us 
say one meter from Hiker's head—such that by the time the boulder 
reaches that point, 1t 1s too late for Hiker to duck uf he has not done 
so already Let B — 1 represent the boulder's presence at this point, 
and B = 0 its absence Let us add the clanfying remark that D = 0 or 
1 according to whether Hiker ducks at the appropriate teme—suffi- 
ciently early to avoid being hit by the boulder The value of D tells us 
nothing about whether Hiker attempts (too late) to duck 1n response 


36 “Causation as Influence,” pp 19495 
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Figure 8 


to the boulder when it ıs only one meter from his head Note how we 
have carefully chosen B so that neither B nor D affect each other 
Here 1s the new set of structural equations 


JM?" F=1,D=F, B= F,S=AB\VD 


The corresponding causal graph 1s shown in figure 8 In fact, B took 
the value 1 Holding this fixed, we must determine whether Hiker's 
survival depends counterfactually upon the boulder's fall Setung F— 
0 and B = 1, we get D = 0, and thus S = 0 Relative to the system of 
structural equations .£/?", the route (F, D, S) ıs indeed active 

Must we then conclude that the boulder's fall did save Hiker's life 
after all? Perhaps this would not be so bad On my account, unlike 
Lewis’s, 1t 1s at least easy to explain why our pretheoretic intuition 
yields the “wrong” answer ın this case The interpolated variable B 1s 
not easy to find, and the ENF counterfactual that reveals the active 
causal route from Fto S1s not at all mtuitive In words, the relevant 
piece of counterfactual reasoning would go as follows suppose that 
the boulder had been present at a point one meter from Hiker’s head 
and flying toward him, and suppose moreover that ıt had never fallen 
in the first place Since it never fell, Hiker would not have seen ıt 
coming and would not have ducked, since ıt would have been there, 
one meter from his exposed head, it would have hit him and he 
would not have survived This counterfactual reasoning 1s correct, but 
bizarre If the boulder never fell, how did it get to be there, one meter 
from Hiker's head? We are to imagine, presumably, that the boulder 
was mysteriously and instantaneously transported to a position 1mme- 
diately ın front of Hiker's head This ıs the sort of counterfactual 
reasoning that only trained philosophers engage in, unaided intu- 
ition 1s not to be faulted for failing to “see” the relevant ENF coun- 
terfactual 


298 THE JOURNAL OF PHILOSOPHY 


I believe, however, that ıt ıs possible to give a less apologetic 
response ?7 Figure 8 and @", while more complete than figure 7 
and BØ, do not constitute an appropriate representation of “Boul- 
der " As we have noted, our causal judgments depend, in part, upon 
which unactualized possibilities we are willing to take seriously, and 
which we consider too remote The variables we choose to include in 
a causal model should reflect these concerns We included the varı- 
ables Fand Dn our original model of “Boulder " This choice reflects 
our willingness to take seriously the possibility that the boulder does 
not fall, and the possibility that Hiker does not duck Moreover, it 
reflects our willingness to take seriously the possibility Hiker does not 
duck even though the boulder falls That 1s, 1t reflects our willingness to 
view Hiker’s life as being at risk Why are we willing take this possi- 
bility seriously, even though it 1s stipulated that Hiker 1s determined 
to duck if the boulder falls? Plausibly, 1t 15 because we recognize the 
nomic connection between the boulder’s fall and Hiker’s duck to be 
highly contingent upon unspecified details of the case Hiker might 
not have ducked if he had been looking ın the wrong direction, if his 
reactions were slowed by tired muscles, if he were less prone to 
reacting coolly 1n times of crisis, and so on It 1s not necessary, 
however, that we have any one of these explanations firmly in mind 
in order to take seriously the possibility that Hiker does not duck 
when the boulder falls 

When we exclude the variable B from our model, 1t 15 not because 
we are unwilling to take seriously the possibility that the boulder was 
not present at that point (one meter from Hiker's head) We take 
that possibility seriously when we entertain the possibility that the 
boulder does not fall in the first place Rather, we are not willing to 
take seriously the possibility that the boulder (or a boulder of similar 
size and shape) comes to be in that position even though the boulder does 
not fall ın the first place This possibility 15 Just too far-fetched (Did you 
consider this possibility before I mentioned ıt?) Perhaps one could 
tell a story that would lead us to take this possibihty seriously— 
perhaps Hiker has inadvertently walked in front of a boulder 
launcher that 1s carefully camouflaged against the hillside 38 But in 
Just such a case, we should take the original causal claim seriously by 
causing Hiker to duck ın plenty of time, the fall of the boulder down 
the hillside does indeed save Hiker's life 


37 The following paragraphs owe a great deal to discussion with Pearl 

38 To get the causal structure to come out right, more details would be needed 
In particular, 1t would have to be the case that on this occasion, the launching of a 
boulder is triggered by the original boulder falling down the hillside 
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IX CONCLUSION 

If one subscribes to the thesis that causation 1s transitive, there are 
benefits one can explain how there can be causation without coun- 
terfactual dependence in cases of preemption such as “Backup " But 
there are costs as well the defender of transitivity 1s committed. to 
unintuitive causal claims in cases like “Dog Bite" and "Boulder " The 
benefits are not worth the costs they can be had cost-free The use of 
ENF counterfactuals allows us to accommodate causation without 
counterfactual dependence in cases of preemption, without commit- 
ung us to the counterintuitive consequences of transitivity The use of 
ENF counterfactuals ıs independently motivated by the techniques 
employed in causal modeling 

In addition to reproducing our intuitions in standard test cases, the 
approach recommended here has a number of further advantages It 
reproduces our intuitive causal judgments ın a very natural way the 
counterfactuals appealed to are psychologically natural counterfactu- 
als rather than philosophically technical ones Moreover, ıt repro- 
duces our causal judgments without introducing events or variables 
beyond those explicitly presented in the various scenarios (such as 
intermediate stages in the trajectory of the bullet in “Backup” or the 
boulder in “Boulder”) In short, it reproduces our intuitive Judg- 
ments using only those resources that are available to unaided intu- 
ition The approach recommended here allows for an elegant 
representation of patterns of counterfactual dependence We can 
expheitly represent the structural differences between superficially 
similar cases These representations also allow us to make explicit 
some of our underlying assumptions about a particular case (such as 
that certain possibilities are too far-fetched to be taken seriously) 

Perhaps there 1s still room for the defender of transitivity to ma- 
neuver But when there 1s an alternative account with such striking 
advantages, why bother? 

CHRISTOPHER HITCHCOCK 

California Insutute of Technology 
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PARADOXES OF FAIR DIVISION* 


aradoxes, if they do not define a field, render its problems 
intriguing and often perplexing, especially insofar as the par- 
adoxes remain unresolved Voting theory, for example, has 
been greatly stimulated by the Condorcet paradox, which is the 
discovery by the Marquis de Condorcet! that there may be no alter- 
native that is preferred by a majority to every other alternative, 
producing so-called cyclical majorities Its modern extension and 
generalization 1s Arrow's? theorem, which says, roughly speaking, that 
a certain set of reasonable conditions for aggregating individuals’ 
preferences into some social choice are inconsistent 
In the last fifty years, hundreds of books and thousands of articles 
have been written about these and related social-choice paradoxes 
and theorems, as well as their ramifications for voting and democracy 
Hannu Nurmi? provides a good survey and classification of voting 
paradoxes and also offers advice on “how to deal with them ” 
There ıs also an enormous literature on fairness, Justice, and 
equality, and numerous suggestions on how to rectify the absence of 
these properties or attenuate their erosion But paradoxes do not 
frame the study of fairness in the same way they have inspired 
social-choice theory 
To be sure, the notion that justice and order may be incompatible, 
or that maximin justice 1n John Rawls’s* sense undercuts the motiva- 
tion of individuals to strive to do their best, underscores the possible 
trade-offs 1n making societies more just or egalitarian For example, 
an egalitarian society may require strictures on free choice to ward off 
anarchy, rewarding the worst-off members of a society may deaden 
competition among the most able if their added value 1s siphoned off 
to others 
Obstacles like these which stand in the way of creating a Just society 
are hardly surprising They are not paradoxes in the strong sense of 


* Brams acknowledges the support of the C V Starr Center for Apphed Eco- 
nomics at New York University Research by Edelman was done while he was in the 
School of Mathematics, University of Minnesota We thank Dorothea Herreiner for 
valuable comments 
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constituting a logical contradiction between equally valid principles 
Here we use paradox 1n a weaker sense—as a conflict among fairness 
conditions that one might expect to be compatible Because we are 
surprised to discover this conflict, 1t 1s “nonobvious,” as one of us 
labeled a collection of paradoxes he assembled about politics 

The fair-division paradoxes we present here all concern how to 
divide up a set of indivisible items among two or more players In 
some paradoxes, we assume the players can do no more than rank the 
items from best to worst, ın others, we assume they can, in addition, 
indicate preferences over subsets, or packages, of items While our 
framework 1s generally an ordinalist one, we do admit one cardinal- 
ization of ranks, based on the Borda count used in voting, to facilitate 
certain comparisons, particularly those involving allocations with dif- 
ferent numbers of 1tems 

The main criteria we invoke are efficiency (there 1s no other division 
better for everybody, or better for some players and not worse for the 
others) and envy-freeness (each player likes her allocation at least as 
much as those which the other players receive, so she does not envy 
anybody else) But because efficiency, by itself, is not a criterion of 
fairness (an efficient allocation could be one ın which one player gets 
everything and the others nothing), we also consider other criteria of 
fairness besides envy-freeness, including two different measures of 
how a worst-off player fares (maximin and Borda maximin), which we 
contrast with a utilitarian notion of overall welfare (Borda total 
score) What we rule out, besides the splitting of items, 1s the possi- 
bility of randomizing among different allocations, which ıs another 
way that has been proposed for “smoothing out” inequalities caused 
by the indivisibility of items 9 


5 Brams, Paradoxes in Politics An Introduction to the Nonobvtous in Political. Science 
(New York Free Press, 1976), see also Fishburn, “Paradoxes of Voung,” American 
Political Science Review, LXvuu, 2 (June 1974) 537-46, and Fishburn and Brams, 
“Paradoxes of Preferential Voting,” Mathematics Magazine, Lv1, 4 (September 1983) 
207-14 

6 Fair-division procedures that allow for the splitting of (divisible) goods or the 
sharing of (indivisible) goods—possibly based on a randomization process that 
determines tıme shares—are discussed in, among other places, John Broome, 
Weighing Goods Equality, Uncertainty and Teme (Cambridge Blackwell, 1991), H 
Peyton Young, Equity ın Theory and Practice (Princeton University Press, 1994), 
Hervé Moulin, Cooperative Economics A Game-Theoretic Introduction (Princeton. Unt- 
versity Press, 1995), Brams and Alan D Taylor, Fair Division From Cake-Cutting to 
Dispute Resolution (New York Cambridge, 1996), Jack Robertson and William Webb, 
Cake-Cuttong Algorithms Be Fair If You Can (Natick, MA A K. Peters, 1998), and 
Brams and Taylor, The Win-Win Solution Guaranteeing Fair Shares to Everybody (New 
York Norton, 1999) 
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Our paradoxes demonstrate the opportunities as well as the lımı- 
tations of fair division Thus, for example, while the only division of 
items in which one player never envies the allocation of another may 
be nonexistent or inefficient, we note that there 1s always an efficient 
and envy-free division for two players—even when they rank all 1tems 
the same—as long as they do not rank all subsets of items the same 
We also show that fair division may entail an unequal division of the 
items 

We divide the paradoxes into three categories 


(1) The conflict between efficiency and envy-freeness (paradoxes 1 
and 2) 

(2) The failure of a unique efficient and envy-free division to satisfy 
other criteria (paradoxes 3 and 4) 

(3) The desirability, on occasion, of dividing items unequally (para- 
doxes 5, 6, 7, 8) 


While the paradoxes highlight difficulties in creating “fair shares” 
for everybody, they by no means render the task impossible Rather, 
they show how dependent fair division 15 on the fairness criteria one 
deems important and the trade-offs one considers acceptable Put 
another way, achieving fairness requires some consensus on the 
ground rules (that 1s, criteria) and some delicacy in applying them 
(to facilitate trade-offs when the criteria conflict) 

We mention three technical points before we proceed to specific 
examples First, we assume that players cannot compensate each 
other with side payments—the division is only of the indivisible items 
Second, all players have positive values for every item Third, a player 
prefers one set S of items to a different set Tif (1) S has as many items 
as Tand (11) for every 1tem tın Tand not in S, there ıs a distinct item 
sin Sand not T that the player prefers to t For example, if a player 
ranks items | through 4 1n order of decreasing preference 1234, we 
assume that she prefers 


e the set {1,2} to (2,3), because {1} 1s preferred to {3}, and 
e the set {1,3} to {2,4}, because {1} 1s preferred to {2} and {3} 1s preferred 
to {4} 


whereas the comparison between sets {1,4} and {2,3} could go either 
way 


I EFFICIENCY AND ENVY-FREENESS THEY MAY BE INCOMPATIBLE 
Paradox 1 a unique envy-free dwision may be inefficient Suppose there 1s 
a set of three players, (A, B, C], that must divide a set of six indivisible 
items, (1, 2, 3, 4, 5, 6} Assume the players strictly rank the 1tems from 
best to worst as follows 


PARADOXES OF FAIR DIVISION 303 


Example I 

A 123456 
B432156 
C 512634 


The unique envy-free allocation to (A, B, C) 1s ({1,3}, {2,4}, (5,6]), or for 
simplicity (13, 24, 56), whereby A and B get their best and third-best 
items, and C gets her best and fourth-best items Clearly, A prefers her 
allocation to that of B (which are A’s second-best and fourth-best 
items) and that of C (which are A's two worst items) Likewise, Band 
C prefer their allocations to those of the other two players Conse- 
quently, the division (13, 24, 56) 1s envy-free all players prefer their 
allocations to those of the other two players, so no player ıs envious 
of any other 

Compare this division with. (12, 34, 56), whereby A and B receive 
their two best 1tems, and C receives, as before, her best and fourth- 
best items This division Pareto-dominates (13, 24, 56), because two of 
the three players (A and B) prefer the former allocation, whereas 
both allocations give player C the same two items (56) 

It ıs easy to see that (12, 34, 56) ıs Pareto-optimal, or efficient no 
player can do better with some other division without some other 
player or players doing worse This 1s apparent from the fact that the 
only way A or B, who get their two best items, can do better 1s to 
receive an additional item from one of the two other players— 
assuming all items have some positive value for the players—but this 
will necessarily hurt the player who then receives fewer than her 
present two items Whereas C can do better without receiving a third 
item if she receives item 1 or 2 in place of item 6, this substitution 
would necessarily hurt A, which will do worse if she receives 1tem 6 for 
item 1 or 2 

The problem with efficient allocation (12, 34, 56) ıs that it ıs not 
assuredly envy-free In particular, C will envy A’s allocation of 12 
(second-best and third-best 1tems for C) if she prefers these two 1tems 
to her present allocation of 56 (best and fourth-best items for C) In 
the absence of information about C's preferences for subsets of items, 
therefore, we cannot say that efficient allocation (12, 34, 56) 1s 
envy-free 7 


? Henceforth we shall mean by ‘envy-free’ a division such that, no matter how the 
players value subsets of items consistent with their rankings, no player prefers any 
other player's allocation to her own If a division 1s not envy-free, we call it 
envy-possible if a player's allocation may make her envious of another player, depend- 
ing on how she values subsets of items, as illustrated by division (12, 34, 56) in the 
text Itis envy-ensuring if ıt causes envy, independent of how the players value subsets 
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But the real bite of paradox 1 stems from the fact that not only is 
inefficient division (13, 24, 56) envy-free, but it 1s uniquely so—there 
ıs no other division, including an efficient one, that guarantees 
envy-freeness To show this in example I, note first that an envy-free 
division must give each player her best item, if not, then a player 
might prefer a division, like envy-free division (13, 24, 56) or efficient 
division (12, 34, 56), that does give each player her best item, ren- 
dering the division that does not envy-possible or envy-ensuring 
Second, even if each player receives her best item, this allocation 
cannot be the only item she receives, because then the player might 
envy any player that receives two or more items, whatever these 
items are 

By this reasoning, then, the only possible envy-free divisions in 
example I are those in which each player receives two items, includ- 
ing her top choice It 1s easy to check that no efficient division 1s 
envy-free 8 Similarly, one can check that no inefficient division, ex- 
cept (13, 24, 56) that gives each player two 1tems—including her 
best—is envy-free, making this division uniquely envy-free 

Paradox 2 there may be no envy-free division, even when all players have 
different preference rankings While ıt 1s bad enough when the only 
envy-free division 1s inefficient (paradox 1), 1t seems even worse when 
there 1s no envy-free division This ss trivial to show when players rank 
items the same For example, 1f two players both prefer item 1 to item 
2, then the player that gets item 2 will envy the player that gets 
item 1 

In the following example, each of three players has a different 
ranking of three 1tems 


Example II 
A123 
B132 
C 213 


There are three divisions, in which each player gets exactly one item, 
which are efficient—(1, 3, 2), (2, 1, 3), and (3, 1, 2)—in each of 
which at least one player gets her best item. It 1s evident that none 1s 





of items In effect, a division that 1s envy-possible has the potential to cause envy By 
comparison, an envy-ensuring division always causes envy, and an envy-free division 
never causes envy 

8 We previously showed that division (12, 34, 56) 1s not envy-free As another 
example, consider efficient division (16, 34, 25) Whereas neither B nor C envies 
each other or A, A might prefer either B's 34 or C's 25 allocations, making this 
division envy-possible 
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envy-free, because the player that gets item 1 1n each (A or B) will be 
envied by at least one of the other two players. For instance, in the 
case of the division (2, 1, 3), both A and C will envy B 

Can an inefficient division be envy-free, as was the case in example 
P It is not hard to see that this situation cannot occur in example II 
for the reason given above the player that gets item 1 will be envied 
But in the case of an inefficient division, "trading up to efficiency" 
reduces the amount of envy For example, consider inefficient division 
(2, 3, 1), 1n which each player receives her second-best choice 
Because A envies C, B envies C, and C envies A, a trade of items 1 and 
2 between A and Cis possible It yields efficient division (1, 3, 2), in 
which only B envies A 

Besides (2, 3, 1), the other two inefficient divisions—(1, 2, 3) and 
(3, 2, 1)—also allow for trading up to efficiency In the first, a trade 
of items 2 and 3 between B and C yields efficient division (1, 3, 2), ın 
the second, a trade of items 1 and 2 between B and C yields efficient 
division (3, 1, 2) Three-way trades are also possible For instance, 
starting from inefficient division (3, 2, 1), a three-way trade, whereby 
A sends item 3 to B, B sends item 2 to C, and C sends item 1 to A, 
yields efficient division (1, 3, 2) 

Trading up to efficiency 1s also possible in example I by exchang- 
ing 1tems 2 and 3, A and B can turn inefficient division (13, 24, 56) 
into efficient division (12, 84, 56) As in example II, however, no 
efficient division is envy-free in example I The difference between 
examples I and II is that example II does not admit even an ineffi- 
cient envy-free division 


II UNIQUE EFFICIENT AND ENVY-FREE DIVISIONS THEIR 
INCOMPATIBILITY WITH OTHER CRITIERIA 


Paradox 3 a unique efficent and envy-free division may lose in voting to an 
efficent and envy-possible division So far we have shown that efficiency 
and envy-freeness may part company either by there being no envy- 
free division that is also efficient (example I), or no envy-free division 
at all (example II) But when these properties coincide, and there is 
both an efficient and an envy-free division, ıt may not be the choice 
of a majority of players, as illustrated by the following example 


Example III 

A 123456 
B 562143 
C 365412 


There are three efficient divisions m which (A, B, C) each get two 
items (12, 56, 34), (12, 45, 36), and (14, 25, 36) But only the third 
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division, (14, 25, 36), 1s envy-free Whereas C might prefer B's 56 
allocation m the first division, and B might prefer A's 12 allocation in 
the second division, no player prefers another player's allocation in 
(14, 25, 36) 

But observe that both A and B prefer the first division, (12, 56, 34), 
to the envy-free third division, (14, 25, 36), because they get their top 
two items in the first division, only C gets her top two items in (14, 25, 
36) Hence, the first division would defeat the envy-free third division, 
(14, 25, 36), by simple majority rule 

The situation 1s not so clear-cut when we compare the second 
division, (12, 45, 36), with the envy-free (14, 25, 36) In fact, there 
would be a tie vote C would be indifferent, because she gets her top 
two items, 36, ın each division, A would prefer the second division 
(top two items versus best and fourth-best items), and B would prefer 
the envy-free division, (14, 25, 36) (best and third-best items versus 
best and fifth-best 1tems) 

Thus, if there were a vote, the unique envy-free division, (14, 25, 
36), would lose to the envy-possible division, (12, 56, 34), and ıt would 
tie with the other envy-possible division, (12, 45, 36) If there were 
“approval voting,"? and A, B, and C voted only for the divisions that 
give each player her two best items, then the envy-free division, (14, 
25, 36), would get 1 vote, compared to 2 votes each for both of the 
envy-possible divisions, (12, 56, 34) and (12, 45, 36) In sum, players 
will choose an envy-possible over the unique envy-free division, (14, 
25, 36), ın either pairwise comparisons or approval voting 

Paradox 4 neither the Rawlsian maximin criterion, nor the Borda total- 
score criterion may choose a unique efficent and envy-free division Besides 
using voting to select an efficient division, consider the following 
Rawlsian maximin criterion to distinguish among efficient divisions 
choose the division that maximizes the minimum rank of items that 
players receive, making a worst-off player as well off as possible To 
illustrate 1n example III, envy-possible division (12, 45, 36) gives a 
fifth-best 1tem to B, whereas each of the two other efficient divisions 
gives a player, at worst, a fourth-best item Between the latter two 
divisions, the envy-possible division, (12, 56, 34) 1s, arguably, better 
than the envy-free division, (14, 25, 36), because it gives the other two 
players—those which do not get a fourth-best 1tem—their two best 


? Under approval voting, voters can vote for as many alternatives as they like, each 
alternative approved of receives one vote, and the alternative with the most votes 
wins See Brams and Fishburn, Approval Voting (Boston Birkhauser, 1983) 
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items, whereas envy-free division (14, 25, 36) does not give B her two 
best items 1° 

A modified Borda count would also give the nod to the envy- 
possible division, (12, 56, 34), compared not only with the envy-free 
division, (14, 25, 36), but also with the other envy-possible division, 
(12, 45, 36) Awarding 6 points for obtaining a best 1tem, 5 points for 
obtaining a second-best item, , 1 point for obtaining a worst item in 
example III, the latter two divisions give the players a total of 30 
points, whereas envy-possible division (12, 56, 34) gives the players a 
total of 31 points,!! which we call their Borda total scores and use as a 
measure of the overall utility or welfare of the players Hence an 
envy-possible division beats the unique envy-free division, based on 
both the maxımın criterion and the Borda total-score criterion. (We 
shall later apply the Borda count to individual players, asking what 
division maximizes the minimum Borda score that any player re- 
ceives ) 


III THE DESIRABILITY OF UNEQUAL DIVISIONS (SOMETIMES) 
Paradox 5 an unequal division of items may be preferred by all players to an 
equal division In section n, we showed that neither (1) pairwise 
comparison voting or approval voting (paradox 3), nor (u) the 
maximin criterion or the Borda total-score criterion (paradox 4), 
always selects a unique efficient and envy-free division In the follow- 
ing example, there ıs also a unique efficient and envy-free divi- 
sion—in which all players receive the same number of items 
(henceforth called an equal division) — but there may be grounds for 
choosing an efficient but unequal envy-possible division 


Example IV 
A 1284 
B 2841 


10 This might be considered a second-order application of the maximin criterion 
rf, for two divisions, players rank the worst item any player receives the same, 
consider the player that receives a next-worst item in each, and choose the division 
in which this item 15 ranked higher This ıs an example of a lexicographic decision rule, 
whereby alternatives are ordered on the basis of a most important criterion, if that 
1s not determinative, a next-most important criterion 1s invoked, and so on, to 
narrow down the set of feasible alternatives 

!! The standard scoring rules for the Borda count in this 61tem example would 
give 5 points to a best item, 4 points to a second-best item, , 0 points to a worst 
item We depart slightly from this standard scoring rule to ensure that each player 
obtains some positive value for all items, including her worst choice, as assumed 
earher 
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It 1s not difficult to show that (13, 24) is the only efficient and 
envy-free division Two other equal divisions, (12, 34) and (14, 23), 
while better for one player and worse for the other, are envy-possible 

The above three equal divisions all give Borda total scores of 12 to 
their players If we eliminate the envy-possible division, (14, 23), on 
the grounds that it fails the maximin criterion by giving A her worst 
item (item 4), then the comparison reduces to that between envy-free 
division, (13, 24), and envy-possible division, (12, 34) 

Curiously, ıt 1s possible that both A and B prefer the unequal 
envy-possible division, (134, 2), to the equal envy-possible division, 
(12, 34) !? Thus, unequal divisions might actually be better for all 
players than equal divisions 

Ruling out equal division (12, 34) ın such a situation, let us 
compare (134, 2) with the envy-free (equal) division (13, 24) Clearly, 
(134, 2) 1s better than (13, 24) for A, but ıt ıs worse for B 

This leaves open the question of which of these two divisions, 
involving an equal and an unequal division of the items, comes closer 
to giving the two players “fair shares " As the next paradox shows, an 
unequal division may actually be more egalitarian—as measured by 
Borda scores for individual players—than an equal division 

Paradox 6 an unequal division of items may (1) maximize the minimum 
Borda scores of players (Borda maxımın) and (u) maximize the sum of Borda 
scores (Borda maxsum) In paradox 5, we showed that an unequal but 
envy-possible division of items may compare favorably with an equal 
and envy-free division To make this kind of comparison more pre- 
cise, consider the following example 


Example V 

A123456789 
B312456789 
C 412365789 


There are exactly two unequal divisions, (12, 357, 4689) and (12, 
3589, 467), that maximize the minimum Borda scores of players, 
which are [17, 17, 17] for both divisions ! On the other hand, there 
are two equal divisions, (129, 357, 468) and (129, 358, 467), that 
maximize the minimum Borda scores of players, which are [18, 17, 
16] for the first division and [18, 16, 17] for the second division 
These are all the divisions whose Borda total scores are 51, which, it 


12 This 1s true if A prefers 34 to 2, and B prefers 2 to 34 
1? Henceforth we shall indicate the Borda scores of players [in brackets] to 
distinguish them from item allocations (ın parentheses) 
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can be shown, ıs the maximal sum, or Borda maxsum, among all 
possible divisions (equal or unequal) 

Notice that the worst-off player in the two unequal divisions garners 
17 points (so does the best-off player, because the Borda scores of all 
players are the same), whereas the worst-off player 1n the two equal 
divisions receives fewer points (16) By the maximin criterion, but 
now based on Borda scores, the unequal divisions are more egalitar- 
ian We call this the Borda maximn criterion, which 1s especially useful 
in comparing equal and unequal divisions !* 

None of the four equal or unequal divisions 1s envy-free—all are 
envy-possible or envy-ensuring Likewise, all four divisions are “effi- 
cient-possible” in the sense that there may be a more efficient divi- 
sion, but this ıs not guaranteed Take, for example, the unequal 
division (12, 357, 4689) Bor C might prefer A's 12 allocation, just as 
A might prefer B's or C's allocation, so a trade could make two, or 
even all three, players better off Unlike our previous examples, in 
which divisions called "efficient" were all “efficient-ensuring” (that is, 
there were no trades that could improve the lot of all traders, however 
players valued subsets of 1tems), this is not the case in example V 

The Borda maximin criterion seems a reasonable one to distin- 
guish among all efficientpossible and envy-possible divisions In ex- 
ample V, ıt 15 not only unequal divisions that do best on this criterion, 
but these divisions are also Borda maxsum, making them both fair 
and utihty-maximizing (according to the Borda cardinahzation of 
utility—more on its limitations, illustrated ın example VII, later) 

Paradox 7 an unequal drvision of items may be Borda maxsum but not 
Borda maximi, There was no conflict between Borda maxsum and 
Borda maximin in example V—two unequal divisions satisfied both 
these properties But as the next example illustrates, this need not be 
the case 


Example VI 
A123 
B132 


14 To see why, consider the two unequal divisions in example IV ın which no 
player receives a fourth-best item (1, 234) and (123, 4) To call these divisions 
maximin—like equal division (13, 24), 1n which no player receives a fourth-best 
item as well—seems highly questionable, because the player receiving her top three 
items in these two divisions can hardly be considered worse off (because she 
receives a third-best item) than the player receiving only her top item. Indeed, the 
Borda scores of the players in the two unequal divisions, [4, 9] and [9, 2], reveal 
how inegahtarian these divisions are, particularly when compared with equal Borda 
maximin division (13, 24) with Borda scores of [6, 6] 
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There are two unequal maxsum divisions, (12, 3) and (2, 13), 
whose Borda scores are, respectively, [5, 2] and [2, 5] Each gives a 
Borda total score of 7, and a minimum Borda score of 2 for a player 

By contrast, there are two Borda maximin divisions, (1, 23) and 
(23, 1), both of which give Borda scores of [3, 3] While they give the 
players a lower Borda total score (6) than the Borda maxsum dive- 
sions, they give the players a higher minimum score of 3 

Presumably, the egalitarian would choose one of the two Borda 
maximin divisions, whereas the utilitarian would choose one of the 
two Borda maxsum divisions Because there are an odd number of 
items to divide in example VI, all the divisions between A and B are 
necessarily unequal But both Borda maximin divisions and Borda 
maxsum divisions can be the either equal (unlike example VI, the 
Borda maximin division in example IV is equal, as discussed in 
footnote 14) or unequal (two of the Borda maxsum divisions in 
example V are equal and two are unequal) It also turns out that 
Borda maxsum and Borda maximin scores can be arbitrarily far 
apart 15 

We believe that when Borda maximin and Borda total scores 
choose different divisions, Borda maximin generally gives the fairer 
division by guaranteeing that the Borda score of the worst-off player 
Is as great as possible ! As we shall show in our final paradox, 
however, a Borda maximin division may be quite implausible, de- 
pending on how players value subsets, or ıt may not be envy-free 
when, at the same time, there exists an envy-free division that ıs 
neither maximin nor Borda maximin 

Paradox 8 1f there are envy-free divisions, none may be maximin or Borda 
maxim In the followng example, there are two players but an odd 
number of items, so no equal division of the items 1s possible 


Example VII 
412345 
B12845 


15 Brams, Edelman, and Fishburn, "Fair Division. of Indivisible Items" (unpub- 
lished) 

1e To be sure, assuming that the differences ın ranks are all equal, as Borda 
scoring does, 1s a simplification If cardinal utilities could be elicited that reflect the 
players’ intensities of preference, then these utilities—instead of the rank scores— 
could be used to equalize, insofar as possible, players’ satisfaction with a division of 
the items For fair-division bidding schemes that incorporate cardinal information, 
see Brams and Taylor, Fair Division and The Win-Win Solution, Brams and D Marc 
Kilgour, “Competitive Fair Division,” Journal of Political Economy, c1x, 2 (April 2001) 
418-43, and Claus-Jochen Haake, Matthias G Raith, and Francis E Su, “Bidding for 
Envy-Freeness A Procedural Approach to nPlayer Fair-Division Problems,” Social 
Choice and Welfare (forthcoming) 
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Because the players rank the items exactly the same, all divisions are 
efficient, making the choice of a fairest one appear difficult 

Only six divisions, however, are what Brams and Fishburn!’ call 
undominated splits 


(1, 2345), (12, 345), (13, 245), (14, 235), (15, 234), (145, 23) 


These divisions are those ın which, ın the absence of information 
about preferences over subsets, either of the two allocations in each 
might be preferred by a player, making each undominated. All these 
divisions, therefore, are envy-possible 

The Borda maximin divisions are (13, 245) and (14, 235), which 
give Borda scores of, respectively, [8, 7] and [7, 8] to the players But 
neither division might be envy-free if, say, both players prefer alloca- 
tion 13 to 245 in the first and allocation 14 to 235 in the second—that 
1$, both prefer the "same side" of each division These preferences 
imply that both prefer allocation 12 to 345 in the second division, 
(12, 345), and allocation 145 to 23 in the sixth division, (145, 23), 
precluding these divisions, as well, from being envy-free 

Thus, the preferences of A and B assumed above would eliminate 
four of the undominated splits from being envy-free, allowing the two 
remaining divisions to be so For example, A might prefer allocation 
l m the first division and allocation 15 m the fifth, whereas B might 
prefer the complements allocation 2345 in the first, and allocation 
234 in the fifth 

In none of our previous examples with envy-free divisions was such 
a division not Borda maximin But as we have just illustrated, there 
may be several envy-free divisions, none of which 1s Borda maximin 
This divergence points to the limitation of Borda maximin as a 
criterion for choosing divisions, because Borda scoring may not 
reflect the intensity of player preferences that can be better gleaned 
from player preferences over subsets 

But we do not need to know player preferences over subsets to 
show that an envy-free division may not be Borda maximin 


Example VIII 

A 123456789 
B581267349 
C 349125678 


17 “Fair Division of Indivisible Items between Two People with Identical Prefer- 
ences Envy-freeness, Pareto-optimality, and Equity,” Social Choice and Welfare, XVII, 
2 (February 2000) 247-67 
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It ıs easy to see that division (127, 568, 349) ıs envy-free, but it gives 
a seventh-best item to A By contrast, division (126, 587, 349) gives 
sixth-best items to A and B and the same 349 allocation to C 

Because no other divisions (equal or unequal) give players lowest- 
ranked items that are as high as sixth-best, division (126, 578, 349) 1s 
maximin But itis not envy-free B may envy A, because she may prefer 
allocation 126 to 578, making this division envy-possible 

The Borda scores of the envy-free division are [20, 22, 24], whereas 
those of the maximin division are [21, 21, 24], so the maximin 
division 1s also Borda maximin Both the envy-free and maximin/ 
Borda maximin divisions have total Borda scores of 66, which 1s also 
the Borda maxsum 1n example VIII 

This example illustrates what we think 1s our most striking paradox 
Specificially, without any special assumptions about the preferences 
of the players for subsets of items, 1t shows the clash between envy- 
freeness and both maximm and Borda maximin Furthermore, be- 
cause there are no unequal divisions in example VIII that satisfy any 
of our fairness criteria— or, for that matter, the Borda maxsum 
criterion—it highlights the difficulty of choosing a fairest allocation, 
even in the equal-division case Should one help the worst-off, or 
avoid envy, when one cannot do both?!® 


18 This question is more fully explored ın Brams and Daniel King, "Efficient Fair 
Division Help the Worst Off or Avoid Envy?” (unpublished), in which it 1s shown 
not only that maximin and Borda maximin divisions may not be envy-free but also 
that all such divisions may actually ensure envy To illustrate this conflict, consider 
the following example 


A123456 
B123456 
C 154623 


There are four Borda maximin divisions—(14, 23, 56), (23, 14, 56), (13, 24, 56), 
(24, 13, 56), each giving a minimum Borda score of 8 to a player—which are also 
maxamin divisions (a worst-off player receives a fourth-best ttem) In addition, there 
are two maximin divisions which are not Borda maximin divisions—(12, 34, 56), 
(34, 12, 56)—which also give a worst-off player a fourth-best 1tem All srx divisions 
ensure envy 1n each, one player prefers another player's two items to her own This 
example demonstrates that maximin and Borda maximm divisions, rather than Just 
preclude envy-freeness, may guarantee envy (that 1s, be envy-ensuring rather than 
Just envy-possible, as in example VIII) Furthermore, unlike example II, 1n which 
the unique efficient maximin and Borda maximin division, (1, 3, 2), 1s also envy- 
ensuring, the present example involves each player's receiving two items, which one 
might think would be sufficient to allow a maximin or Borda maximin division to 
be envy-possible, 1f not envy-free This, however, 1s not the case, undersconng the 
seriousness of the conflict among our fairness criteria 
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IV CONCLUSIONS 

The eight paradoxes pinpoint difficulties in dividing up indivisible 
items so that each player feels satisfied, 1n some sense, with her 
allocation The first two paradoxes show that efficient and envy-free 
divisions may be incompatible because the only envy-free division 
may be inefficient, or there may be no envy-free division at all 

Both of these paradoxes require at least three players '? When 
there are only two players, even when they rank items exactly the 
same, 1t turns out that efficient and envy-free divisions can always be 
found, except when the players have the same preferences over all 
subsets of items 7° 

But the existence of even a unique efficient and envy-free division 
may not be chosen by the players for other reasons In particular, 
such a division will not necessarily be selected when players vote for 
the division or divisions that they prefer Also, a unique efficient and 
envy-free division will not necessarily be the division that maximizes 
the minimum rank of items that players receive, so the Rawlsian 
maximin criterion of making the worst-off player as well off as possi- 
ble may not single it out 

As a way of measuring the value of allocations to find those divi- 
sions which are most egalitarian, especially in comparing equal and 
unequal divisions, we used Borda scoring based on player rankings of 
the items We showed that a Borda maximin division may not be a 
Borda maxsum division, indicating the possible conflict between 
egalitarian and utilitarian outcomes 

This difference may show up when there are as few as two players 
dividing up three items, making it impossible to divide the items 
equally between the players But even when this 1s possible, unequal 
divisions of items may be the only ones that satisfy the Borda maximin 
criterion While indicating a preference for this criterion over the 
Borda maxsum criterion when the two clash, we illustrated how 
Borda maximin divisions may fail badly in finding envy-free divisions 
Indeed, there may be no overlap between Borda maximin and envy- 
free divisions 

Our purpose is not just to indicate the pitfalls of fair division by 
exhibiting paradoxes that can occur There are also opportunites, 
but these depend on the judicious application of selection criteria 
when not all criteria can be satisfied sumultaneously 


19 Edelman and Fishburn, "Fair Division of Indivisible Items among People with 
Similar Preferences,” Mathematical Social Sciences, XLi, 3 (May 2001) 327-47 

20 Brams and Fishburn, “Fair Division of Indivisible Items between Two People 
with Identical Preferences Envy-freeness, Pareto-Opumahty, and Equity " 
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Several recent papers have suggested constructive procedures for 
finding the most plausible candidates for fair division of a set of 
indivisible items ?! We find this direction promising, because it 1s 
potentially applicable to ameliorating, if not solving, practical prob- 
lems of fair division—ranging from the splitting of the marital prop- 
erty in a divorce to determining who gets what in an international 
dispute While some conflicts are ineradicable, as the paradoxes 
demonstrate, the trade-offs that best resolve these conflicts are by no 
means evident 

STEVEN J BRAMS 
New York University 
PAUL H EDELMAN 
Vanderbilt University 
PETER C FISHBURN 
AT&T Shannon Laboratory 


21 Brams and Fishburn, “Fair Division of Indivisible Items between Two People 
with Identical Preferences Envy-freeness, Pareto-Opumality, and Equity”, Edelman 
and Fishburn, Brams, Edelman, and Fishburn, Brams and King, and Dorothea 
Herreiner and Clemens Puppe, “A Simple Procedure for Finding Equitable Allo- 
cations of Indivisible Goods,” Social Chore and Welfare (forthcoming) 
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Mind ın a Physical World An Essay on the Mind-body Problem and Mental 
Causation JAEGWON KIM Cambridge Massachusetts Institute of Tech- 
nology Press, 1998 vu + 146 p Paper $15 00 * 


During the previous century, philosophers of mind attempted to 
fashion a view of the relationship between the mental and the phys- 
ical which 1s at once both physicalisuc and nonreductive At one time 
(and for some time), Jaegwon Kim thought that the concept of 
supervenience could be harnessed to spell out the relationship be- 
tween the mental and the physical to this end The idea was that, 
while the supervenience relation 1s tight enough to capture what 1s 
right about physicalism, it 1s also loose enough to be compatible with 
the irreducibility of the mental to the physical The main message of 
Mind ın a Physical Worlds that nonreductwve physicalism (NRP) does not 
work Kim argues that his and other versions of NRP inevitably run 
aground on the problem of mental causation If he is right about this, 
then philosophers true to their physicalism will have to swallow 
reductionism Those who find reductionism impossible to swallow 
will have to find a way of living without physicalism 

Kim sees the problem of mental causation as arising from the 
incompatibility of the following four plausible propositions 


(1) Many (or all) mental events cause physical events (alternatively 
many mental properties are causally relevant to some physical prop- 
erties) 

(2) Mental events are distinct from physical events (the constitutive 
properties of mental events are not physical properties) 

(3) The physical 1s nomologically and causally complete and closed 

(4) Physical events are not pervasively causally and/or nomologically 
overdetermined 


Each of these propositions taken singly 1s plausible, but together they 
are inconsistent At least one of the four has to go The question 1s 
which? 

Kum’s discussion of NRP involves a number of ontological assump- 
tions One 1s a “sparse” view of properties on which properties are 


* These are lectures that Kim originally gave as the Townsend Lectures at the 
Unwersity of California/Berkeley, ın March 1996 This review 1s the better for 
having discussed these issues with Ned Block, Carsten Hansen, and Fritz Warfield 
and for having read drafts of their reviews 
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"mindandependent" features that “cut reality at its Jomts " On this 
view, not every predicate expresses a genuine property and it may be 
that not every property 1s expressed by a predicate of our language 
That properties cut nature at its Joints means that they are involved in 
laws and causal relations in an appropriate way It follows, for exam- 
ple, that there 1s no guarantee that a disjunction of predicates that 
express properties will also express a property ! If ıt turns out that a 
putative property 1s not involved in a law and/or causal relations in 
the right way, then it i5 not a genuine property According to Kim, an 
event 1s an instantiation of a property by an individual (or a relation 
by the appropriate number of individuals) at a ime A mental event 
is the instantiation of a mental property For physicalism to have 
content, something must be said about the issue of characterizing the 
physical Kim's view seems to be that the microphysical properties of 
ideal physics are physical ? He also counts as physical properties that 
are conjunctions and aggregates of microphysical properties and 
higher-level properties defined over lower-level physical properties 
(115) 3 If reductionism 1s true, then those mental predicates which 
express genuine properties express physical properties 

The claim that physics ıs causally and nomologically closed and 
complete means that each physical event has a sufficient—or as 
sufficient as possible—causal/nomological explanation in terms of 
prior physical events The way I understand it, the causal/ nomolog- 
ical completeness of physics is a contingent claim whose exact con- 
tent depends on how ‘physical event’ is understood I think most 
physicists would agree that there 1s strong evidence in its favor when 
‘physical event’ 1s understood as “event describable in the language of 
ideal micro-physics "4 

NRP 1s a family of views differing in their ontological frameworks 
(views about properties, events, laws, and so on) and how they un- 


‘In some places, Kim suggests not only that genuine properties are always 
involved in causal relations, but the stronger view that (some of) their causal 
relations are essenual to them 

? I am not sure how one should characterize “ideal physics" except that ıt 1s safe 
to assume that the language of ideal physics contams no intentional or phenomenal 
predicates I have a bit more to say about this in "From Physics to Physicalism,” in 
Carl Gillett and myself, eds , Physicalism and Its Discontents (New York Cambndge, 
2001), pp 37-56 

? Kim thinks that his characterization results in chemical and biological proper- 
ties counting as physical, but it does not Being a water molecule is not an aggregate 
or conjunction of fundamental microphysical properties but a vast disjunction, 
since water molecules can occupy infinitely many quantum states 

* This 1s Steven Weinberg's view in Dreams of a Final Theory (New York Random 
House, 1992) 
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derstand ‘reduction’ and ‘physicalism’ Following Kim, I understand 
the irreducibility of the mental to the physical as meaning that there 
exist mental events and properties that are distinct from any physical 
events and properties 5 Kim observes that a necessary condition for 
physicalism 1s that mental properties, events, and laws supervene on 
physical ones He considers various understandings of 'superve- 
nience’, but I think that any kind of physicalism worthy of the name 
requires a supervenience condition I shall call Jackson physicalism (JP) 
formulated as 


JP any minimal physical duplicate of the actual world is a duplicate 
simpliciter 9 


Some complications aside, JP entails that true mental propositions— 
for example, Jaegwon 1s thinking about sailing—are metaphysically 
entailed by true physical propositions (including propositions ex- 
pressing physical laws) Kim thinks that supervenience relations are 
too weak to capture the idea that mental-property instantiations 
depend on physical-property instantiations, so he adds that the men- 
tal depends on the physical ” This requirement threatens to make the 
supervenience condition superfluous and leaves dependence to be 
explicated Perhaps dependence can be captured by adding that the 
physical facts are the only natural class of facts which satisfies the JP 
scheme 

The conflict among (1-4) ıs obvious If P occurs at t, the physical 
state S (which may consist of a fusion of many microphysical events) 
at tk fully accounts for P If M supervenes on S and is a cause of P, 
then ıt appears that P 1s overdetermined by S and M If there 1s no 
such overdetermination, then either M does not cause P or 1s 1den- 
tical to some part of S If the latter, then it 1s a physical event and so 


5 Irreducibility 1s often taken also to have an epistemic component For example, 
the connections between physical and mental facts (properties, events) 1s enor- 
mously complex, nonexplanatory, epistemically inaccessible, and so on 

^ Frank Jackson—From Melaphysics to Ethics (New York Oxford, 1998)—gives this 
necessary condition on physicalism It captures the idea that all God has to do to 
create our world is to create its physical part, metaphysical connections take care of 
the rest That it 1s a necessary condition follows from the fact that, if ıt failed, then 
God would have more work to do after creating the physical facts and laws 

7 Two of the reasons that Kim gives for holding that supervenience ts too weak to 
capture physicalism have to do with features of his formulation of supervenience 
One 1s that he formulates the doctrine as involving properties That leaves the 
possibility that there are laws and other facts that do not supervene on the physical 
Another 1s that he allows that the notion of necessity involved in supervenience 1s 
nomological necessity That allows for the possibility of fundamental bridge laws 
linking physical and mental properties and the existence of such laws would surely 
violate physicalism 
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(2) fails René Descartes’s resolution of the problem of mental 
causation rejects the completeness of physics But this ıs not an 
option for physicalists since 1t involves there bemg nomological or 
causal relations that do not metaphysically supervene on the physical 
Epiphenomenalists reject (1) Given. Kim’s view about properties, 
that 1s tantamount to the view that mental predicates fail to express 
genuine properties, that 1s, to eliminativism Reductive physicalists 
reject (2) But there are persuasive and widely accepted arguments 
that mental predicates do not express physical properties Given 
Kim’s view of events, this means that mental events are not physical 
events It looks like the only way out 1s to reject (4), but, as we shall 
see, Kim thinks there are reasons against that as well It 1s not for 
nothing that Arthur Schopenhauer calls the mind-body problem “the 
world knot” Kim’s claim 1s that NRP 1s the latest failed attempt to 
untie 1t 

In a nutshell, Kim's argument against NRP 1s this NRP 1s commit- 
ted to (1-3) (1-3) imply the denial of (4) But (4) 1s more plausible 
than the conjunction of (1-3) So one of (1-3) 1s false 

Kim thinks that escape from the problem of mental causation by 
rejecting (4) 1s not a reasonable option In Mind in a Physical World, 
he gives three reasons for this (44, 45) (A) ıt ıs implausible that every 
physical event that has a mental cause 1s causally overdetermined, (B) 
given the causal completeness and closure of physics mental causes 
are dispensable, (C) overdetermination may come into conflict with 
physical causal closure In addition to these arguments, Kim seems to 
suggest that (4) 1s a conceptual truth But so far as I can see, he 
provides no argument for this claim other than the preceding reasons 
and they, even if successful, fall short of establishing that 

A typical case of causal overdetermmaton 1s the example of two 
assassins each independently shooting at the vicam In this kind of 
example, the causes are wholly distinct events zndependent of each 
other Call this zndependent overdetermmation The kind of case relevant 
to NRP 1s, ın contrast, one ın which the overdetermining causes are 
metaphysically connected to each other An example of this kind 1s a 
neurophysiological event N realizing a mental event M (say, a desire 
for a sip of beer) and where both cause E (say the subject's hand's 
moving) Call this kind dependent overdetermmatin Note that, ın both 
cases, the causes by themselves do not determine the effect but, at 
most, determine for the effect given the laws and further circum- 
stances Independent overdetermination ıs possible but rare If our 
evidence 1s that the prime minister has been shot, it will usually be 
more reasonable to posit a single assassin rather than two assassins 
acting independently The reason that we think the first kind of 
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overdetermination ıs generally not plausible ıs that, when we posit 
two independent causes Cl and C2 for E, we raise the question of why 
both C1 and C2 and why the events that occurred led to the same 
effect But in the case of causes that are metaphysically connected, we 
do not face these questions ? So Kim's remark that pervasive overde- 
termination of certam physical events by mental and physical events 
is implausible carries no force against NRP 

Kim's suggestion that mental causes are "dispensable" fares no 
better The only way M could be dispensed with 1s by dispensing with 
P and P may not be dispensable with respect to E, that 1s, if P had 
failed to occur, E would have failed to occur What Kim may have in 
mind ıs not that M is dispensable but that its being a cause of E 1s 
dispensable Kim may have something like the following argument in 
mind There are two possible worlds w and w* in which (2) and (3) 
hold and which are exactly alike physically and mentally but differ 
only in that in w some mental events are overdetermining causes of 
certain physical events and in w* mental events are epiphenomenal 
Our evidence concerning causal relations would be exactly the same 
in wand w* Since w contains facts over and above w* which are 
explanatorily idle with respect to the evidence, we should believe that 
the actual world is w* rather than w This argument seems to be 
appealing to the principle ‘Don’t multiply causes without necessity’ 
Whatever the merits of this principle in general, it 1s not clear that ıt 
applies to this case since the two causes are necessarily connected 

Kim’s final argument 1s this 


For consider a world in which the physical cause does not occur and 
which ın other respects ıs as much like our world as possible The 
overdetermination approach says that in such a world, the mental cause 
causes a physical event—namely, that the principle of causal closure of 
the physical no longer holds I do not think we can accept this conse- 
quence that a minimal counterfactual supposition like that can lead to 
a major change in the world (45) 


This is puzzlmg Consider two worlds ın W1, M occurs and N does 
not but M1s realized by another brain state N* and both M and M* 
are causes of E, in W2, M occurs and causes E but E has no physical 
cause Kim seems to be saying that W2 1s more similar to the actual 


8In previous writings, Kim formulated an “exclusion principle” according to 
which no event has two zndependent causal explanations and which he uses to argue 
against overdetermination The exclusion principle figures a little less prominently 
in these lectures, perhaps because 1t does not seem applicable to mental-physical 
causation in NRP since, on it, mental and physical events are not metaphysically 
independent 
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world than W1 But why think this? In WI, supervenience and the 
completeness of physics both hold, while in W2 they both fail In fact, 
it seems obvious that we accept ‘If M had occurred but had not been 
realized by N, 1t would have been realized by another neural state’ 
and reject the corresponding counterfactual with the consequent 
' supervenience and the completeness of physics would have failed’ 

It becomes clear why Kim thinks that dependent overdetermina- 
tion ıs ruled out when he describes what he thinks 1s the right way to 
describe the situation if NRP holds 


However, 1f we understand the difference between genuine, productive 
and generative causal processes, on the one hand, and noncausal regu- 
larites that are observed because they are parasitic on real (sic) causal 
processes, we are 1n a position to understand the picture recommended 
by (vin) (1e by NRP) In the case of the supposed M-M* causation the 
situation 1s rather like a series of shadows cast by a moving car. The 
moving car represents a genuine causal process, but the series of shad- 
ows 1t casts, however regular and law-like it may be does not constitute a 
causal process (45) 


Kim 1s thinking of causation as a relation ın which the cause generates 
or produces the effect I am not sure what these metaphors come to, 
but they suggest that in some way, E grows out of C In any case, 1f we 
think of causation m this way then each of the reasons that Kim gives 
against overdetermination appears more convincing (1) If both N 
and M each produce E, then, even if they are metaphysically con- 
nected, they can do so independently of each other (2) If we have 
reason to believe that N produces E, then we have no need also to 
posit M as productive of E That hypothesis is dispensable (3) If M 
produces E, then, ıf we just take N away, M would still produce E, 
thereby violating the causal completeness of the physical If we reject 
overdetermination, then we shall have to say that M does not produce 
M* (or anything else) but is like the shadow of the car. So if we think 
of causation 1n terms of one event generating or producing another, 
then it looks like we may have good reason to hold onto (4) Finally, 
the metaphor of C producing E or E growing out of C suggests that ıt 
1s impossible for E to grow out of any distinct E In the case of 
independent overdetermining causes, this would mean that the effect 
grows out of both, each contributing some aspects of the effect (for 
example, two bullet wounds) But in the case of NRP, it looks as 
though M contributes nothing more that N already contributes ? 


? As I mentioned, it 1s not clear whether Kim thinks that the nonexistence of 
overdetermining causes is a plausible claim based on an evidentiary principle ‘Don’t 
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Under the heading “free lunches,” Kim discusses counterfactual 
accounts of causation as a way for NRP to satisfy (1-3) and motivate 
rejecting (4) The basic idea of the counterfactual account is that, on 
a certain understanding of counterfactuals, C causes Eif Cand E are 
distinct events (or propositions stating their occurrence) and E coun- 
terfactually depends on C, that ıs, 5 C2 —E!? On David Lewis's 
well-known account of counterfactuals, the completeness of physics 1s 
compatible with the overdetermmation of counterfactual dependence 
So why 1s this not a way in which NRP can both have and eat its cake? 
Kim gives two reasons One 1s that the counterfactual account of causa- 
tion 1s inadequate as an account of causation The other is that Lewis's 
account of counterfactuals—which may solve some of problems he raises 
for the counterfactual account—involves “heavy metaphysical assump- 
tions” (130, footnote 19) Since Kim does not say what these assump- 
tions are, I shall address only the first kind of worry ™ 

Kim's main objection 1s that the counterfactual account does not 
distinguish epiphenomenal relations from genuine causation Let $1 
be the location of the car's shadow at tl and S2 at t2, and Cl and C2 the 
corresponding locations of the car The counterfactuals —S1—S2 and 
—S1-C2 may strike us as true, though there 1s no causal relation between 
S1 and either S2 or C2 Further, it seems to Kim that the supervenience 
of M on N 1s compatible with epiphenomenalism with respect to E 
(indeed Kim thinks this ıs the case if Mis distinct from N) and yet in this 
situation -5 M—AE So the counterfactual account cannot distinguish 
epiphenomenalism from mental causaton Kim also observes that 
—E>— E* may be true when E does not cause E*, but both are effects of 
a common cause and that ~E>-—C may be true when C causes E If 
these objections are correct, then the counterfactual account fails 

I think these objections can be overcome by combining an ım- 
proved version of Lewis's account of counterfactuals with a slightly 





muluply causes beyond necessity’ or a conceptual/necessary truth It looks like 
Kim's argument against NRP requires the latter, since 1t 1s hard to see why this 
instance of the evidentiary principle 1s more plausible than the conjunction of (1-3) 

10 David Lewis— Philosophical Papers, Volume II (New York Oxford, 1986)—pro- 
poses a condition that 1s necessary as well as sufficient. Ccauses Eif and only if there 
1s a chain of counterfactual dependencies from C to E There are well-known 
problems with the necessity of the account due to pre-emptive causes Lewis has 
recently modified his account greatly to deal with them, see “Causation as Influ- 
ence,” this JOURNAL, XCVII, 4 (April 2000) 182-97 But whether or not this 1s 
successful if the condition 1s sufficient, and it could be shown that it permits mental 
causation that 1s compatible with NRP, Kim’s objections would be answered 

11 Kim may have ın mind Lewis's view that possible worlds are concrete totalities, 
but issues about the nature of possible worlds are orthogonal to issues concerning 
the adequacy of counterfactual accounts of causation 
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more complicated account of causation On Lewis's account, A B is 
true 1f and only if there are worlds at which A and B are true and no 
world at which A&—B is true ıs more similar to the actual world than 
1s some world at which A&B ıs true Two factors determine the 
ranking of worlds as more or less similar to the actual world (1) the 
size of the regions of match in occurrent fact, and (1) the size of the 
regions of conformity to laws of the actual world According to the 
account, ‘If I had struck the match at ¢1t would have ht at t” 1s true 
since the most similar world(s) to the actual world ıs one that 1s 
exactly like the actual world until a time shortly before t when a small 
violation of the actual laws leads to my striking the match The laws 
then evolve that world so to entail that the match lights at ¢’ 

It ıs important to keep in mind that in evaluating Lewis's counter- 
factual account of causation, one must employ counterfactuals char- 
acterized by Lewis's proposed similanty relation and not merely 
appeal to intuitions It may be that we are inclined to affirm the 
counterfactual ‘If her mother had not had blue eyes, then neither 
would she,’ even though we know that ıt 1s not the mother's blue eyes 
but rather a gene factor that 1s a cause of the daughter’s blue eyes 
But this 1s irrelevant as an objection to the counterfactual account of 
causation, since Lewis 1s characterizing a special class of counterfac- 
tuals and, on his characterization, this counterfactual (plausibly) 
comes out false 1? Lewis argues that his counterfactuals (and so the 
account of causation in terms of these counterfactuals) can handle 
the problem of the direction of causation and the problem of effects 
of a common cause !? In order to handle the problem concerning 
epiphenomenalism for higher-level events, we need to complicate 
Lewis's account Say that M causes E if 5.M——E and there 1s no 
lower-level event which preempts this relation which 1s not itself 
preempted by M !* N preempts M with respect to Eif 5 NE and 


1? That 1s, Lewis thinks that the most similar world m which the mother does not 
have blue eyes ıs one ın which she still has the gene factor for blue eyes but 
something goes wrong in the phenotypic expression of that factor in her and that 
factor 1s passed on to the daughter who then has blue eyes 

13 In fact, as Adam Elga and Douglas Husak have recently argued in dissertations 
(at Massachusetts Institute of Technology and Rutgers University, respectively), 
Lewis's account of counterfactuals requires revision having to do with statistical 
mechanics But, this 15 not relevant to my defense of the role of counterfactuals in 
mental causation since there are repairs that can be made in the Lewis account to 
avoid the problems 

14 We also need to include that the physical states that are the supervenience 
bases for M and E are connected by fundamental dynamicallaw A proposal along 
these lines 1s found 1n Ernest Lepore and myself, "Mind Matters," this JOURNAL, 
LXXXIV, 11 (November 1987) 630-42, and ın Lepore and my “More on Making 
Mind Matter More" Philosophical Topics (1991) 157-91 
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N&—M>E The intuitive idea is that M causes Eif E counterfactually 
depends on N but does not counterfactually depend on Mas long as 
Noccurs This condition clearly makes the shadow epiphenomenal 
with regard to the car’s position and the future shadow Both the 
mental epiphenomenalist and the interactionist dualist hold that 
mental events are (at most) connected by bridge laws to neurophys- 
iological states 1° It follows that N&- M is metaphysically possible 
Suppose that a M —E so M is a putative cause of E The epiphe- 
nomenalist holds that it ıs not a genuine cause since N preempts M, 
while the interactionist dualist holds that M is not preempted and is 
a genuine cause The nonreductive physicalist holds that there are 
mental events M that are putative causes that are not preempted by 
events that they themselves do not preempt The existence of such 
mental events 1s compatible with nonreductive physicalism 

I do not expect Kim to be persuaded by my counterfactual defense 
of causation He is likely to consider it “causation hte” when com- 
pared to causation as production or generation But the generation/ 
production conception of causation fits ill with contemporary 
physics Bertrand Russell!® famously said that causation so under- 
stood “is a relic of a bygone age, surviving, like the monarchy, only 
because ıt ıs erroneously ıs supposed to do no harm” (sbid , p 1) 
What Russell had in mind 1s that the fundamental laws and facts of 
physics do not mention causality at all If one 1s a physicalist, then one 
holds that whatever causal relations there are supervene on the 
fundamental laws and facts of physics But the fundamental laws (for 
example, Schrodinger’s law) relate the totality of the physical state at 
one instant to the totality at later instants 17 The laws do not single 
out parts of states at different times as being causally related If S' is 
the microphysical state ın a region Rat time and /' 1s a time prior 
to £ then nothing less than the state S of the region R* that fills the 
backward light cone of R can be said to produce S' We cannot say 
that one event (one part of the physical state) produces another part 
since the laws do not connect parts in this way If we want an account 
of causation which selects among the parts of the state at ¢ the parts 
that are responsible for a subsequent event, we have to look else- 
where One approach is to ask which events, given the laws, make a 


15 Such bridge laws do not supervene on the physical laws and states 

16 “On the Notion of Cause,” Proceedings of the Aristotelian. Society, XII (1912) 

17 Or, a little more correctly, a region of the state at time /1s related by law to the 
states ın 1ts backward light This point, which ıs often neglected by philosophers, 1s 
forcefully made in Noa Latham, "Singular Causal Statements and Strict Determin- 
istic Laws," Pacific Philosophical Quarterly, LXVIN (1987) 29-43, and Hartry Field, 
“Causation in a Physical World" (manuscript) 
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difference to whether or not the effect occurs or how much of a 
difference 1t makes to what occurs This is the idea that counterfac- 
tual accounts attempt to capture I am not confident that Lewis’s 
particular counterfactual account (with my additions) quite captures 
this idea But I do think that there must be some account on which 
causal relations, to the extent they are real, are determined by the 
fundamental laws and the noncausal fundamental facts On the other 
hand, the production conception of causation seems to require 
causal relations that fail to supervene on the fundamental laws (as 
they are conceived in contemporary physics) and facts That is, it 
seems to allow the existence of two universes that match perfectly in 
fundamental laws (of the sort posited by contemporary physics) and 
facts and yet differ in which events "produce" which others This 
indeed ıs a heavy metaphysical assumption 

The problem of mental causation seems insoluble to Kim because 
he thinks of causation in terms of metaphors of generation and 
production This itself ıs a good reason to follow Russell’s suggestion 
and dispense with this conception of causation I suggest that the 
difference making conception of causation suitably spelled out in 
terms of counterfactuals may be all we can get and all we need from 
causation 18 In this case, a free lunch may be preferable to indigest- 
ible metaphysics 

BARRY LOEWER 

Rutgers University 


18 There are sull other problems mvolving mental causation The nonsuperve- 
nience of intentional mental states on brain states raises worries about whether the 
relevant counterfactuals that ground difference making causation are forthcoming 
Also, a satisfactory account of mental causation needs to spell out the mechanisms 
by which mental states are causally related to each other and to physical states Kim 
mentions these problems, but the book is almost entirely devoted to the problem 
presented by the mcompatibihty of (1-4) 
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Semantics, Tense, and Tıme PETER LUDLOW Cambridge Massachusetts 
Institute of Technology Press, 1999 xxi + 252 p Cloth $45 00 


This 1s not your typical book about the A-theory/B-theory controversy 
in metaphysics Peter Ludlow attempts something that few philoso- 
phers have tried in the last thirty years he actually argues from 
linguistic premises for metaphysical conclusions The relevant lin- 
guistic premises have to do with the nature of language, a general 
theory of semantics, the proper analysis of tense, and various techni- 
cal theses involving the treatment of temporal indexicals and tempo- 
ral anaphora (among other things) The metaphysical conclusions 
that Ludlow argues for from these linguistic premises are some of the 
main claims normally associated with the A-theory in the philosophy 
of ume, namely, (1) that tense 1s a genuine feature of the world 
(which means, roughly, that there are monadic, temporal proper- 
ties—like pastness, presentness, futurity, being two days past, being 
three days future, and so on (sometimes called “A-properties”)—and 
that the instantiation of these properties does not somehow reduce to 
the instantiation of two-place, temporal relations like earlier than, 
simultaneous with, later than, and so on), (u) that temporal becom- 
ing (roughly, successively coming to possess different A-properties) 1s 
intrinsic to all events, and (111) that only the present ıs real 

The overall plan of the book is as follows First, Ludlow spends four 
chapters defending a cluster of related claims about language and 
semantics in general and, in particular, the semantics for temporal 
indexicals and temporal anaphora (He says that none of this mate- 
rial 1s original—he attributes most of ıt to Donald Davidson, Noam 
Chomsky, Gareth Evans, and James Higginbotham—but it seems to 
me that a fair portion of what goes into this part of the book 
(including, especially, Ludlow’s way of combining various ideas) 1s 
indeed both original and of genuine significance ) Then he devotes 
a chapter to developing a B-theory semantics, and another chapter to 
discussing what he takes to be certain insurmountable problems with 
the B-theory semantics Next, he devotes a chapter to developing an 
A-theory semantics, and another chapter to discussing what he takes 
to be the main problems facing those A-theory semantics, together 
with his reasons for thinking that those problems can be overcome 
The penultumate chapter of the book is devoted to arguing that 
considerations from “cognitive psychology generally” and “the theory 
of concept/language acquisition specifically” favor the A-theory over 
the B-theory And the final chapter contains remarks by Ludlow on 
what he takes to be the philosophical and linguistic consequences of 
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his investigations There are also two “philosophical appendices” and 
five “technical appendices ” (Taken together, the appendices consti- 
tute about one-fourth of the entire book ) 

Semantics, Tense, and Time is an impressive and ambitious book 
Ludlow touches on a huge number of different arguments, almost all 
of which raise interesting and important issues (In fact, some will 
find the book to be almost too ambitious, and will wish that Ludlow 
had focused on a smaller number of topics and arguments, in order 
to be able to consider ın greater detail those arguments and the most 
likely rejoinders to them ) There is no way I could do justice to even 
half a dozen of the most important arguments in the book, so in the 
remainder of this review, I shall limit my discussion to three main 
worries concerning Ludlow's treatment of some of his central topics 

My first worry has to do with the relation between the A-theory, as 
formulated by Ludlow, and the claam—sometimes called presentism— 
that only present objects exist Ludlow formulates the A-theory in 
such a way that it incorporates presentism But in my mund, this is 
both unnecessary and problematic 

Building presentism into the A-theory ıs unnecessary because, al- 
though presentism entails the A-theory, the opposite 1s not true That 
1$, the essential clarm of the A-theory—that tense 1s a genuine feature 
of the world— does not entail presentism For one could believe that 
tense is a genuine feature of the world—that various events, for 
example, have the property of being past, while others have the 
property of being present and still others have the property of being 
future—without thereby being committed to the claim that only 
present things exist In other words, ıt ıs quite possible to be an 
A-theorist but not a presentist (In fact, I suspect that the majority of 
A-theorists are not also presentists ) 

So there 1s no reason why we are forced to build presentism into 
the A-theory Moreover, doing so 1s problematic because of the many 
difficulties facing the presentist For example, Ludlow 1s forced to 
conclude that the A-theorist must avoid all reference to past and 
future things, including past and future times This turns out to be a 
major constraint on the different possible A-theory semantics that 
Ludlow considers, and it leads to all manner of difficulties for the 
A-theorist (For the record, I think that there are some important 
ways around this problem which are available to the presentist but not 
considered by Ludlow, including the construction of past and future 
times out of present resources (such as maximal propositions) +) 


` 


! For more on this line, see Arthur Prior and Kit Fine, World, Times and Selves 
(London Duckworth, 1977), and Edward Zalta, “On the Structural Similarities 
between Worlds and Times,” Philosophical Studies .1°(1987) 213-39 
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My second main worry has to do with Ludlow's treatment of what 
he considers to be the biggest problem facing the B-theory, namely, 
the problem of temporal indexicals As Ludlow spells 1t out, the 
problem begins with an alarming episode from his personal life It 
seems that Ludlow is sitting at his desk one day (March 12, to be 
specific), unaware of the exact date, when he has the thought that his 
fifth anniversary ıs coming up sometime soon, and so calmly utters a 
token of this sentence 


(1) My fifth anniversary 1s (this) March 12 


A short time later, he happens to glance at his calendar, whereupon he 
goes into a panic, and frantically utters a token of the following sentence 


(2) My fifth anniversary 1s today 


Suppose we call these two utterances “U1” and “U2,” respectively 
Then 1t should be clear that, intuitively, there ıs a huge difference 
between the meanings of U1 and U2 

The A-theorist can explain our intuition that Ul and U2 have 
different meanings by invoking the putative A-property of present- 
ness Thatis, he can say that U2 asserts that Ludlow’s fifth anniversary 
has the property of bemg present, while Ul merely asserts that 
Ludlow’s fifth anniversary and March 12 are simultaneous with each 
other And this difference in what the two utterances assert can be 
attributed to a difference in the meanings of Ul and U2 

The B-theorist, on the other hand, is limited. to just B-theory 
resources, so she cannot explain our intuition that UJ and U2 have 
different meanings ın this way And, in fact, if we limit ourselves to 
considering just B-theory concepts, we seem forced to say that the two 
utterances say exactly the same thing 

Ludlow considers various possible ways for the B-theorist to re- 
spond to this problem Each of the ways he considers 1s a semantical 
solution that involves trying to give “tenseless” truth conditions for 
sentences hke U2 And each of the proposed solutions either fails 
miserably, or else leads to further difficulties, or both Thus, Ludlow 
concludes, consideration of the problem of temporal indexicals 
shows that the B-theory should be rejected 

This appears at first glance to be a good argument against the B- 
theory, but I think that the argument turns out to be a little bit too good 
The problem of temporal indexicals 1s indeed a serious problem for the 
B-theorist ? But, fortunately for the B-theorist, ıt ıs not a problem that 


2See Prior, "Thank Goodness That's Over," in his Papers im Logic and Ethics 
(London Duckworth, 1976), pp ,78—84, and Richard Gale, The Language of Time 
(London Routledge & Kegan Pè 77968) 
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1$ peculiar to the B-theory For as John Perry? has shown, the problem 
of temporal indexicals 1s a special case of a more general problem 
involving temporal, spatial, and personal indexicals alike 

To see this, consider a variation on Ludlow's example Suppose 
that Ludlow ıs cycling in Vermont, and has promised to call his wife 
from the town of Ludlow immediately upon arriving there Late one 
morning, he arrives at a picturesque town and stops for lunch As he 
sits ın the town square munching on a sandwich, his thoughts turn to 
his wife and home, and he calmly utters a token of this sentence 


(3) I promised to call my wife as soon as I arrived ın Ludlow 


A short time later, a passerby happens to remark casually as he walks 
past, “Welcome to Ludlow,” whereupon Ludlow (the man, not the 
whole town) goes into a panic, and frantically utters a token of the 
following sentence 


(4) I promised to call my wife as soon as I arrived here 


Now, it 1s clear that these last two utterances—call them U3 and U4, 
respectively— do not at first glance appear to have the same meaning 
But it 1s not at all clear that this should count as evidence for the 
claim that there are special, monadic, spatial properties like hereness 

Or consider this example Suppose that Ludlow 1s suffering from 
amnesia, and cannot remember who he 1s or what he has done While 
puzzling over those topics, he 1s reading with great fascination from 
a book called Semantics, Tense, and Time He glances at the title page 
and calmly utters a token of this sentence 


(5) Ludlow ıs the author of Semantics, Tense, and Time 


A short time later, something in the book jogs his memory, and it all 
comes back to him, whereupon he excitedly utters a token of the 
following sentence 


(6) I am the author of Semantics, Tense, and Time 


I think that what Ludlow’s example and these other examples all show 
is that there is something essential about indexicals, whether they be 
temporal, spatial, or personal indexicals But I do not think there is any 
reason to conclude that any such example shows that there must be 
ineliminable, monadic properties of the relevant kind Moreover, I think 
that, 1f Ludlow’s argument from temporal indexicals succeeds, then it 
shows too much For there will be an analogous argument from spatial 
indexicals, whose conclusion ıs that there are ineliminable, monadic, 


* "The Problem of the Essential Indexical,” Noás, xii (1979) 3-21 
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spatial properties, and there will also be an analogous argument from 
personal indexicals, whose conclusion 1s that there are ineliminable, 
monadic, personal properties Since these are wildly unplausible con- 
clusions about the metaphysics of places and the metaphysics of people, 
we should resist the spatial and personal analogues of the argument 
from temporal indexicals And this suggests that, whether or not we have 
independent reasons for accepting the A-theory (such as, for example, 
the fact that it strikes many of us as obviously true), we should not 
endorse the argument from temporal indexicals for that theory 

The third of my main worries ıs a general one that concerns 
Ludlow's overall approach to metaphysics Ludlow explicitly says that 
he sees his entire project as an attempt to show that one can argue 
from the structure of human language to the nature of reality My 
worry is this Even if we grant that aspects of a natural language 
(including its semantics) reflect the metaphysical commitments of 
that language's speakers, we have no reason to think that those 
commitments must be correct After all, 1t ıs perfectly possible to have 
a language built around some false metaphysical views 

To be fair, this 1s a worry that applies to a large number of 
well-known arguments in philosophy, many of which are nghtfully 
taken with great seriousness But I do think that, insofar as an 
argument for a metaphysical conclusion has purely linguistic. pre- 
mises (as opposed to a mixture of metaphysical and linguistic pre- 
muses), we should be highly suspicious of that argument And I worry 
that many of Ludlow’s main arguments may fall into this category 

One final note, on a related point The title of Ludlow’s book 1s 
well chosen It accurately reflects the contents of the book, which 1s 
more about semantics than ıt ıs about tense, and more about tense 
than ıt ıs about tıme There are some philosophical purists who will 
be put off by this approach, and who will consider the book to be for 
linguists rather than philosophers But even if this ts true, it ıs also 
true that Semantics, Tense, and Time 1s a remarkably rich sourcebook 
for both philosophers of time and linguists The book contains 
numerous fascinating cases—mainly in the form of sentences from 
natural languages—that present all kinds of perplexing puzzles and 
provocative problems for various theories in both the philosophy of 
time and linguistics 

NED MARKOSIAN 

Western Washington University 
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NOTES AND NEWS 


The Editors regret the death of the eminent Amencan philosopher 
Willard Van Orman Quine on December 25, 2000, at the age of 92 A 
native of Akron, Ohio, Professor Quine enjoyed a long and distin- 
guished career in philosophy, making contributions in logic, set theory, 
philosophy of language, epistemology, and other fields After receiving 
his BA in 1930 from Oberlin College, he began his doctoral studies at 
Harvard University, completing his Ph D under the guidance of Alfred 
North Whitehead in 1932 Quine was first appointed a Junior Fellow for 
the Society of Fellows at Harvard from 1933-1936, hired as an Instructor 
in 1936, promoted to full professorship ın 1948, and received the Edgar 
Pierce Professorship in 1956 He remained on the faculty untl his 
retirement in 1978 His most influential essays include “Truth by Con- 
vention” (1936), “New Foundations for Mathematical Logic” (1937), 
"On What There Is" (1948), "Two Dogmas of Empiricism” (1951), 
“Ontological Relativity” (1969), and “Epistemology Naturalized” (1969) 
A short list of his distinguished published texts comprises A System of 
Logistic (1934), From a Logical Point of View (1953), Word and Object (1960), 
The Ways of Paradox and Other Essays (1966), Ontological Relativity and Other 
Essays (1969), The Roots of Reference (1974), Theones and Things (1981), 
Pursuit of Truth (1990), and From Stimulus to Scence (1995) Quine wrote 
The Time of My Life An Autobiography m 1985 

The significance of Quine’s contribution to philosophy occasioned 
numerous awards and honors He became a Fellow of the American 
Academy of Arts and Sciences in 1949, and served as President of both 
the Association of Symbolic Logic, 1953-1955, and the American Philo- 
sophical Association, 1957 Quine was a corresponding Fellow for the 
British Academy beginning in 1959, and a corresponding member of the 
Instituto Brasileiro de Filosophia beginning 101963. He became a Fellow 
of the National Academy of Arts and Sciences m 1977, the Institut de 
France in 1978, and the Norwegian Academy of Sciences in 1979 He 
received the Nicholas Murray Butler Gold medal from Columbia Un- 
versity in 1970, the F Polacky Gold Medal ın 1991, and the Charles 
University Gold Medal in 1993 He was recipient of the Rolf Schock 
Prize ın 1993, and the Kyoto Prize in 1996 

Quine, a frequent contributor to this Journal, has been variously 
described as “the foremost representative of naturalism in the second 
half of the twentieth century”, “probably the most important American 
philosopher” since World War II, “our most distinguished systematic 
philosopher”, and “one of America’s preeminent thinkers ” 
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